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COMPOSITIONS, SPLICE VARIANTS AND METHODS 
RELATING TO COLON SPECIFIC GENES AND PROTEINS 

5 INTRODUCTION 

This application claims the benefit of priority from U.S. Provisional Patent 
Application Serial No. 60/431,133 filed December 4, 2002, which is herein incorporated 
by reference in its entirety. 

FIELD OF THE INVENTION 

10 The present invention relates to newly identified nucleic acids and polypeptides 

present in normal and neoplastic colon cells, including fragments, variants and derivatives 
of the nucleic acids and polypeptides. The present invention also relates to antibodies to 
the polypeptides of the invention, as well as agonists and antagonists of the polypeptides 
of the invention. The invention also relates to compositions comprising the nucleic acids, 

15 polypeptides, antibodies, post translational modifications (PTMs), variants, derivatives, 
agonists and antagonists thereof and methods for the use of these compositions. These 
uses include identifying, diagnosing, monitoring, staging, imaging and treating colon 
cancer and/or non-cancerous disease states in colon, identifying colon tissue and 
monitoring and identifying and/or designing agonists and antagonists of polypeptides of 

20 the invention. The uses also include gene therapy, therapeutic molecules including but not 
limited to antibodies or antisense molecules, production of transgenic animals and cells, 
and production of engineered colon tissue for treatment and research. 

BACKGROUND OF THE INVENTION 
Colorectal cancer is the second most common cause of cancer death in the United 

25 States and the third most prevalent cancer in both men and women. M. L. Davila & A. D. 
Da vila, Screening for Colon and Rectal Cancer, in Colon and Rectal Cancer 47 (Peter S. 
Edelstein ed., 2000). The American Cancer Society estimates that there will be about 
105,500 new cases of colon cancer and 42,000 new cases of rectal cancer in 2003 in the 
United States. Colon cancer and rectal cancer will cause about 57,100 deaths combined. 

30 ACS Website: cancer.org on the world wide web. Nearly all cases of colorectal cancer 
arise from adenomatous polyps, some of which mature into large polyps, undergo 
abnormal growth and development, and ultimately progress into cancer. Davila at 55-56. 
This progression would appear to take at least 10 years in most patients, rendering it a 
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readily treatable form of cancer if diagnosed early, when the cancer is localized. Davila at 
56; Walter J. Burdette, Cancer: Etiology. Diagnosis, and Treatment 125 (1998). 

Although our understanding of the etiology of colon cancer is undergoing 
continual refinement, extensive research in this area points to a combination of factors, 
5 including age, hereditary and nonhereditary conditions, and environmental/dietary factors. 
Age is a key risk factor in the development of colorectal cancer, Davila at 48, with men 
and women over 40 years of age becoming increasingly susceptible to that cancer, 
Burdette at 126. Incidence rates increase considerably in each subsequent decade of life. 
Davila at 48. A number of hereditary and nonhereditary conditions have also been linked 

10 to a heightened risk of developing colorectal cancer, including familial adenomatous 

polyposis (FAP), hereditary nonpolyposis colorectal cancer (Lynch syndrome or HNPCC), 
a personal and/or family history of colorectal cancer or adenomatous polyps, inflammatory 
bowel disease, diabetes mellitus, and obesity. Davila at 47; Henry T. Lynch & Jane F. 
Lynch, Hereditary Nonpolyposis Colorectal Cancer (Lynch Syndromes), in Colon and 

1 5 Rectal Cancer 67-68 (Peter S. Edelstein ed., 2000). 

Environmental/dietary factors associated with an increased risk of colorectal 
cancer include a high fat diet, intake of high dietary red meat, and sedentary lifestyle. 
Davila at 47; Reddy, B. S., Prev. Merf. 16(4): 460-7(1987). Conversely, 
environmental/dietary factors associated with a reduced risk of colorectal cancer include a 

20 diet high in fiber, folic acid, calcium, and hormone-replacement therapy in post- 
menopausal women. Davila at 50-55. The effect of antioxidants in reducing the risk of 
colon cancer is unclear. Davila at 53. 

Because colon cancer is highly treatable when detected at an early, localized stage, 
screening should be a part of routine care for all adults starting at age 50, especially those 

25 with first-degree relatives with colorectal cancer. One major advantage of colorectal 
cancer screening over its counterparts in other types of cancer is its ability to not only 
detect precancerous lesions, but to remove them as well. Davila at 56. The key colorectal 
cancer screening tests in use today are fecal occult blood test, sigmoidoscopy, 
colonoscopy, double-contrast barium enema, and the carcinoembryonic antigen (CEA) 

30 test Burdette at 125; Davila at 56. 

The fecal occult blood test (FOBT) screens for colorectal cancer by detecting the 
amount of blood in the stool, the premise being that neoplastic tissue, particularly 
malignant tissue, bleeds more than typical mucosa, with the amount of bleeding increasing 
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with polyp size and cancer stage. Davila at 56-57. While effective at detecting early stage 
tumors, FOBT is unable to detect adenomatous polyps (premalignant lesions), and, 
depending on the contents of the fecal sample, is subject to rendering false positives. 
Davila at 56t59. Sigmoidoscopy and colonoscopy, by contrast, allow direct visualization 
5 of the bowel, and enable one to detect, biopsy, and remove adenomatous polyps. Davila at 
59-60, 61. Despite the advantages of these procedures, there are accompanying 
downsides: sigmoidoscopy, by definition, is limited to the sigmoid colon and below, 
colonoscopy is a relatively expensive procedure, and both share the risk of possible bowel 
perforation and hemorrhaging. Davila at 59-60. Double-contrast barium enema (DCBE) 
10 enables detection of lesions better than FOBT, and almost as well a colonoscopy, but it 
may be limited in evaluating the winding rectosigmoid region. Davila at 60. The CEA 
blood test, which involves screening the blood for carcinoembryonic antigen, shares the 
downside of FOBT, in that it is of limited utility in detecting colorectal cancer at an early 
stage. Burdette at 125. 

15 Once colon cancer has been diagnosed, treatment decisions are typically made in 

reference to the stage of cancer progression. A number of techniques are employed to 
stage the cancer (some of which are also used to screen for colon cancer), including 
pathologic examination of resected colon, sigmoidoscopy, colonoscopy, and various 
imaging techniques. AJCC Cancer Staging Handbook 84 (Irvin D. Fleming et al. eds., 5 th 

20 ed. 1998); Montgomery, R. C. and Ridge, J.A., Semin. Surg. Oncol 15(3): 143-150 
(1998). Moreover, chest films, liver functionality tests, and liver scans are employed to 
determine the extent of metastasis. Fleming at 84. While computerized tomography and 
magnetic resonance imaging are useful in staging colorectal cancer in its later stages, both 
have unacceptably low staging accuracy for identifying early stages of the disease, due to 

25 the difficulty that both methods have in (1) revealing the depth of bowel wall tumor 

infiltration and (2) diagnosing malignant adenopathy. Thoeni, R. F., Radiol Clin. N. Am. 
35(2): 457-85(1997). Rather, techniques such as transrectal ultrasound (TRUS) are 
preferred in this context, although this technique is inaccurate with respect to detecting 
small lymph nodes that may contain metastases. David Blumberg & Frank G. Opelka, 

30 Neoadjuvant and Adjuvant Therapy for Adenocarcinoma of the Rectum, in Colon and 
Rectal Cancer 316 (Peter S. Edelstein ed., 2000). 

Several classification systems have been devised to stage the extent of colorectal 
cancer, including the Dukes' system and the more detailed International Union against 
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Cancer- American Joint Committee on Cancer TNM staging system, which is considered 
by many in the field to be a more useful staging system. Burdette at 126-27. The TNM 
system, which is used for either clinical or pathological staging, is divided into four stages, 
each of which evaluates the extent of cancer growth with respect to primary tumor (T), 
5 regional lymph nodes (N), and distant metastasis (M). Fleming at 84-85. The system 
focuses on the extent of tumor invasion into the intestinal wall, invasion of adjacent 
structures, the number of regional lymph nodes that have been affected, and whether 
distant metastasis has occurred. Fleming at 81 . 

Stage 0 is characterized by in situ carcinoma (Tis), in which the cancer cells are 

10 located inside the glandular basement membrane (intraepithelial) or lamina propria 

(intramucosal). In this stage, the cancer has not spread to the regional lymph nodes (NO), 
and there is no distant metastasis (M0). In stage I, there is still no spread of the cancer to 
the regional lymph nodes and no distant metastasis, but the tumor has invaded the 
submucosa (Tl) or has progressed further to invade the muscularis propria (T2). Stage II 

15 also involves no spread of the cancer to the regional lymph nodes and no distant 

metastasis, but the tumor has invaded the subserosa, or the nonperitonealized pericolic or 
perirectal tissues (T3), or has progressed to invade other organs or structures, and/or has 
perforated the visceral peritoneum (T4). Stage III is characterized by any of the T 
substages, no distant metastasis, and either metastasis in 1 to 3 regional lymph nodes (Nl) 

20 or metastasis in four or more regional lymph nodes (N2). Lastly, stage IV involves any of 
the T or N substages, as well as distant metastasis. Fleming at 84-85; Burdette at 127. 

Currently, pathological staging of colon cancer is preferable over clinical staging 
as pathological staging provides a more accurate prognosis. Pathological staging typically 
involves examination of the resected colon section, along with surgical examination of the 

25 abdominal cavity. Fleming at 84. Clinical staging would be a preferred method of staging 
were it at least as accurate as pathological staging, as it does not depend on the invasive 
procedures of its counterpart. 

Turning to the treatment of colorectal cancer, surgical resection results in a cure for 
roughly 50% of patients. Irradiation is used both preoperatively and postoperatively in 

30 treating colorectal cancer. Chemotherapeutic agents, particularly 5-fluorouracil, are also 
powerful weapons in treating colorectal cancer. Other agents include irinotecan and 
floxuridine, cisplatin, levamisole, methotrexate, interferon-oc, and leucovorin. Burdette at 
125, 132-33. Nonetheless, thirty to forty percent of patients will develop a recurrence of 
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colon cancer following surgical resection, which in many patients is the ultimate cause of 
death. Wayne De Vos, Follow-up After Treatment of Colon Cancer, Colon and Rectal 
Cancer 225 (Peter S. Edelstein ed., 2000). Accordingly, colon cancer patients must be 
closely monitored to determine response to therapy and to detect persistent or recurrent 
5 disease and metastasis. 

The next few paragraphs describe the some of molecular bases of colon cancer. In 
the case of FAP, the tumor suppressor gene APC (adenomatous polyposis coli), 
chromosomally located at 5q21, has been either inactivated or deleted by mutation. 
Alberts et al., Molecular Biology of the Cell 1288 (3d ed. 1994). The APC protein plays a 

10 role in a number of functions, including cell adhesion, apoptosis, and repression of the c- 
myc oncogene. N. R. Hall & R D. Madoff, Genetics and the Polyp-Cancer Sequence, 
Colon and Rectal Cancer 8 (Peter S. Edelstein, ed., 2000). Of those patients with 
colorectal cancer who have normal APC genes, over 65% have such mutations in the 
cancer cells but not in other tissues. Alberts et al., supra at 1288. In the case of HPNCC, 

1 5 patients manifest abnormalities in the tumor suppressor gene HNPCC, but only about 1 5% 
of tumors contain the mutated gene. Id. A host of other genes have also been implicated 
in colorectal cancer, including the K-ras, N-ras, H-ras and c-myc oncogenes, and the 
tumor suppressor genes DCC (deleted in colon carcinoma) and p53. Hall & Madoff, at 8- 
9; Alberts etal.,at 1288. 

20 Abnormalities in Wg/Wnt signal transduction pathway are also associated with the 

development of colorectal carcinoma. Taipale, J. and Beachy, P. A. Nature 411: 349-354 
(2001). Wntl is a secreted protein gene originally identified within mouse mammary 
cancers by its insertion into the mouse mammary tumor virus (MMTV) gene. The protein 
is homologous to the wingless (Wg) gene product of Drosophila, in which it functions as 

25 an important factor for the determination of dorsal- ventral segmentation and regulates the 
formation of fly imaginal discs. Wg/Wnt pathway controls cell proliferation, death and 
differentiation. Taipal (2001). There are at least 13 members in the Wnt family. These 
proteins have been found expressed mainly in the central nervous system (CNS) of 
vertebrates as well as other tissues such as mammary and intestine. The Wnt proteins are 

30 the ligands for a family of seven transmembrane domain receptors related to the Frizzled 
gene product in Drosophila. Binding Wnt to Frizzled stimulates the activity of the 
downstream target, Dishevelled, which in turn inactivates the glycogen synthesase kinase 
3P(GSK3P). Taipal (2001). Usually active GSK30 will form a complex with the 
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adenomatous polyposis coli (APC) protein and phosphorylate another complex member, 
P-catenin. Once phosphorylated, P-catenin is directed to degradation through the ubiquitin 
pathway. When GSK3P or APC activity is down regulated, P-catenin is accumulated in 
the cytoplasm and binds to the T-cell factor or lymphocyte excitation factor (TcfiTLef) 
5 family of transcriptional factors. Binding of P-catenin to Tcf releases the transcriptional 
repression and induces gene transcription. Among the genes regulated by P-catenin are a 
transcriptional repressor Engrailed, a transforming growth factor-P (TGF-p) family 
member Decapentaplegic, and the cytokine Hedgehog in Drosophila. P-Catenin is also 
involved in regulating cell adhesion by binding to a-catenin and E-cadherin. On the other 

10 hand, binding of P-catenin to these proteins controls the cytoplasmic p-catenin level and 
its complexing with TCF. Taipal (2001). Growth factor stimulation and activation of c- 
src or v-src also regulate P-catenin level by phosphorylation of a-catenin and its related 
protein, pl20 cas . When phosphorylated, these proteins decrease their binding to E- 
cadherin and P-catenin resulting in the accumulation of cytoplasmic P-catenin. Reynolds, 

15 A.B. et al. Mol Cell Biol 14: 8333-8342 (1994). In colon cancer, c-src enzymatic activity 
has been shown to be increased to the level of v-src. Alternation of components in the 
Wg/Wnt pathway promotes colorectal carcinoma development The best known 
modifications are to the APC gene. Nicola S et al. Hum. Mol Genet 10:721-733 (2001). 
This germline mutation causes the appearance of hundreds to thousands of adenomatous 

20 polyps in the large bowel. It is the gene defect that accounts for the autosomally 

dominantly inherited FAP and related syndromes. The molecular alternations that occur 
in this pathway largely involve deletions of alleles of tumor-suppressor genes, such as 
APC, p53 and Deleted in Colorectal Cancer (DCC), combined with mutational activation 
of proto-oncogenes, especially c-Ki-ras. Aoki, T. et al. Human Mutat. 3: 342-346 (1994). 

25 All of these lead to genomic instability in colorectal cancers. 

Another source of genomic instability in colorectal cancer is the defect of DNA 
mismatch repair (MMR) genes. Human homologues of the bacterial mutHLS complex 
(hMSH2, hMLHl, hPMSl, hPMS2 and hMSH6), which is involved in the DNA mismatch 
repair in bacteria, have been shown to cause the HNPCC (about 70-90% HNPCC) when 

30 mutated. Modrich, P. and Lahue, R. Ann Rev. Biochem. 65: 101-133 (1996); and 

Peltomaki, P. Hum. Mol Genet 10: 735-740 (2001). The inactivation of these proteins 
leads to the accumulation of mutations and causes genetic instability that represents errors 
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in the accurate replication of the repetitive mono-, di-, tri- and tetra-nucleotide repeats, 
which are scattered throughout the genome (microsatellite regions). Jass, J.R. et al. J. 
Gastroenterol Hepatol 17: 17-26 (2002). Like in the classic FAP, mutational activation of 
c-Ki-ras is also required for the promotion of MSI in the alternative HNPCC. Mutations 
5 in other proteins such as the tumor suppressor protein phosphatase PTEN (Zhou, X.P. et 
al. Hum. Mol Genet 11: 445-450 (2002)), BAX (Butder, L.M. Aus. N. Z. J. Surg. 69: 88- 
94 (1999)), Caspase-5 (Planck, M. Cancer Genet Cytogenet. 134: 46-54 (2002)), TGF0- 
RII (Fallik, D. et al. Gastroenterol Clin Biol. 24: 917-22 (2000)) and IGFU-R 
(Giovannucci E. J. Nutr. 131: 3109S-20S (2001)) have also been found in some colorectal 

10 tumors possibly as the cause of MMR defect. 

Some tyrosine kinases have been shown up-regulated in colorectal tumor tissues or 
cell lines like HT29. Skoudy, A. et al. Biochem J. 317 ( Pt 1): 279-84 (1996). Focal 
adhesion kinase (FAK) and its up-stream kinase c-src and c-yes in colonic epithelial cells 
may play an important role in the promotion of colorectal cancers through the extracellular 

15 matrix (ECM) and integrin-mediated signaling pathways. Jessup, J.M. et al., Hie 

molecular biology of colorectal carcinoma, in: The Molecular Basis of Human Cancer, 
251-268 (Coleman W.B. and Tsongalis GJ. Eds. 2002). The formation of c-src/FAK 
complexes may coordinately deregulate VEGF expression and apoptosis inhibition. 
Recent evidences suggest that a specific signal-transduction pathway for cell survival that 

20 implicates integrin engagement leads to FAK activation and thus activates PI-3 kinase and 
akt. In turn, akt phosphorylates BAD and blocks apoptosis in epithelial cells. The 
activation of c-src in colon cancer may induce VEGF expression through the hypoxia 
pathway. Other genes that may be implicated in colorectal cancer include Cox enzymes 
(Ota, S. et al. Aliment Pharmacol. Bier. 16 (Suppl 2): 102-106 (2002)), estrogen (al- 

25 Azzawi, F. and Wahab, M. Climacteric 5: 3-14 (2002)), peroxisome proliferator-activated 
receptor-y (PPAR-y) (Gelman, et al. Cell Mol. Life Sci. 55: 932-943 (1999)), IGF-I 
(Giovannucci (2001)), thymine DNA glycosylase (TDG) (Hardeland, IL et al. Prog. 
Nucleic Acid Res. Mol Biol 68: 235-253 (2001)) andEGF (Mendelsohn, J. Endocrine- 
Related Cancer 8: 3-9 (2001)). 

30 Gene deletion and mutation are not the only causes for development of colorectal 

cancers. Epigenetic silencing by DNA methylation also accounts for the loss of function 
of colorectal cancer suppressor genes. A strong association between MSI and CpG island 
methylation has been well characterized in sporadic colorectal cancers with high MSI but 



WO 2004/050858 



PCTYUS2003/038808 



8 

not in those of hereditary origin. In one experiment, DNA methylation of MLH1 , 
CDKN2A, MGMT, THBS1, RARB, APC, and pl4ARF genes has been shown in 80%, 
55%, 23%, 23%, 58%, 35%, and 50% of 40 sporadic colorectal cancers with high MSI 
respectively. Yamamoto, H. et al. Genes Chromosomes Cancer 33: 322-325 (2002); and 
5 Kim, K.M. et al. Oncogene. 12;21(35): 5441-9 (2002). Carcinogen metabolism enzymes 
such as GST, NAT, CYP and MTHFR are also associated with an increased or decreased 
colorectal cancer risk. Pistorius, S. et al. Kongressbd Dtsch Ges Chir Kongr 118: 820-824 
(2001); and Potter, J.D. J. Natl Cancer Inst 91: 916-932 (1999). 

From the foregoing, it is clear that procedures used for detecting, diagnosing, 

10 monitoring, staging, prognosticating, and preventing the recurrence of colorectal cancer 
are of critical importance to the outcome of the patient. Moreover, current procedures, 
while helpful in each of these analyses, are limited by their specificity, sensitivity, 
invasiveness, and/or their cost. As such, highly specific and sensitive procedures that 
would operate by way of detecting novel markers in cells, tissues, or bodily fluids, with 

15 minimal invasiveness and at a reasonable cost, would be highly desirable. 

Accordingly, there is a great need for more sensitive and accurate methods for 
predicting whether a person is likely to develop colorectal cancer, for diagnosing 
colorectal cancer, for monitoring the progression of the disease, for staging the colorectal 
cancer, for determining whether the colorectal cancer has metastasized, and for imaging 

20 the colorectal cancer. Following accurate diagnosis, there is also a need for less invasive 
and more effective treatment of colorectal cancer. 

Growth and metastasis of solid tumors are also dependent on angiogenesis. 
Folkman, J., 1986, Cancer Research, 46, 467-473; Folkman, J., 1989, Journal of the 
National Cancer Institute, 82, 4-6. It has been shown, for example, that tumors which 

25 enlarge to greater than 2 mm must obtain their own blood supply and do so by inducing 
the growth of new capillary blood vessels. Once these new blood vessels become 
embedded in the tumor, they provide a means for tumor cells to enter the circulation and 
metastasize to distant sites such as liver, lung or bone. Weidner, N., et al, 1991, Tfie New 
England Journal of Medicine, 324(1), 1-8. 

30 Angiogenesis, defined as the growth or sprouting of new blood vessels from 

existing vessels, is a complex process that primarily occurs during embryonic 
development. The process is distinct from vasculogenesis, in that the new endothelial cells 
lining the vessel arise from proliferation of existing cells, rather than differentiating from 
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stem cells. The process is invasive and dependent upon proteolysis of the extracellular 
matrix (ECM), migration of new endothelial cells, and synthesis of new matrix 
components. Angiogenesis occurs during embryogenic development of the circulatory 
system; however, in adult humans, angiogenesis only occurs as a response to a 
5 pathological condition (except during the reproductive cycle in women). 

Under normal physiological conditions in adults, angiogenesis takes place only in 
very restricted situations such as hair growth and wounding healing. Auerbach, W. and 
Auerbach, R., 1994, Pharmacol Ther. 63(3):265-3 11; Ribatti et al.,1991, Haematologica 
76(4):3 11-20; Risau, 1997, Nature 386(6626):67 1-4. Angiogenesis progresses by a 

10 stimulus which results in the formation of a migrating column of endothelial cells. 

Proteolytic activity is focused at the advancing tip of this "vascular sprout", which breaks 
down the ECM sufficiently to permit the column of cells to infiltrate and migrate. Behind 
the advancing front, the endothelial cells differentiate and begin to adhere to each other, 
thus forming a new basement membrane. The cells then cease proliferation and finally 

15 define a lumen for the new arteriole or capillary. 

Unregulated angiogenesis has gradually been recognized to be responsible for a 
wide range of disorders, including, but not limited to, cancer, cardiovascular disease, 
rheumatoid arthritis, psoriasis and diabetic retinopathy. Folkman, 1995, Nat Med 1(1):27- 
31; Isner, 1999, Circulation 99(13): 1653-5; Koch, 1998, Arthritis Rheum 41(6):951-62; 

20 Walsh, 1999, Rheumatology (Oxford) 38(2):103-12; Ware and Simons, 1997, Nat Med 
3(2): 158-64. 

Of particular interest is the observation that angiogenesis is required by solid 
tumors for their growth and metastases. Folkman, 1986 supra; Folkman 1990, J Natl. 
Cancer List., 82(1) 4-6; Folkman, 1992, Semin Cancer Biol 3(2):65-71; Zetter, 1998, Annu 

25 Rev Med 49:407-24. A tumor usually begins as a single aberrant cell which can proliferate 
only to a size of a few cubic millimeters due to the distance from available capillary beds, 
and it can stay 'dormant* without further growth and dissemination for a long period of 
time. Some tumor cells then switch to the angiogenic phenotype to activate endothelial 
cells, which proliferate and mature into new capillary blood vessels. These newly formed 

30 blood vessels not only allow for continued growth of the primary tumor, but also for the 
dissemination and recolonization of metastatic tumor cells. The precise mechanisms that 
control the angiogenic switch is not well understood, but it is believed that 



WO 2004/050858 



PCT/US2003/038808 



10 

neovascularization of tumor mass results from the net balance of a multitude of 
angiogenesis stimulators and inhibitors Folkman, 1995, supra: 

One of the most potent angiogenesis inhibitors is endostatin identified by O'Reilly 
and Folkman. O'Reilly et al., 1997, Cell 88(2):277-85; O'Reilly et aL, 1994, Cell 79(2):3 
5 15-28. Its discovery was based on the phenomenon that certain primary tumors can inhibit 
the growth of distant metastases. O'Reilly and Folkman hypothesized that a primary tumor 
initiates angiogenesis by generating angiogenic stimulators in excess of inhibitors. 
However, angiogenic inhibitors, by virtue of their longer half life in the circulation, reach 
the site of a secondary tumor in excess of the stimulators. The net result is the growth of 

10 primary tumor and inhibition of secondary tumor. Endostatin is one of a growing list of 
such angiogenesis inhibitors produced by primary tumors. It is a proteolytic fragment of a 
larger protein: endostatin is a 20 kDa fragment of collagen XVm (amino acid HI 132- 
K1315 in murine collagen XVIII). Endostatin has been shown to specifically inhibit 
endothelial cell proliferation in vitro and block angiogenesis in vivo. More importantly, 

1 5 administration of endostatin to tumor-bearing mice leads to significant tumor regression, 
and no toxicity or drug resistance has been observed even after multiple treatment cycles. 
Boehm et al., 1997, Nature 390(6658):404-407. The fact that endostatin targets genetically 
stable endothelial cells and inhibits a variety of solid tumors makes it a very attractive 
candidate for anticancer therapy. Fidler and Ellis, 1994, Cell 79(2): 185-8; Gastl et al, 

20 1997, Oncology 54(3): 1 77-84; Hinsbergh et al., 1999, Ann Oncol 10 Suppl 4:60-3. In 
addition, angiogenesis inhibitors have been shown to be more effective when combined 
with radiation and chemofherapeutic agents. Klement, 2000, J. Clin Invest, 105(8) R15- 
24. Browder, 2000, Cancer Res. 6-(7) 1878-86, Arap et al, 1998, Science 279(5349):377- 
80; Mauceri et al, 1998, Nature 394(6690):287-91. 

25 SUMMARY OF THE INVENTION 

The present invention solves many needs in the art by providing nucleic acid 
molecules, polypeptides and antibodies thereto, variants and derivatives of the nucleic 
acids and polypeptides, and agonists and antagonists thereto that may be used to identify, 
diagnose, monitor, stage, image and treat colon cancer and/or non-cancerous disease states 

30 in colon; identify and monitor colon tissue; and identify and design agonists and 

antagonists of polypeptides of the invention. The invention also provides gene therapy, 
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methods for producing transgenic animals and cells, and methods for producing 
engineered colon tissue for treatment and research. 

One aspect of the present invention relates to nucleic acid molecules that are 
specific to colon cells, colon tissue and/or the colon organ. These colon specific nucleic 
5 acids (CSNAs) may be a naturally occurring cDNA, genomic DNA, RNA, or a fragment 
of one of these nucleic acids, or may be a non-naturally occurring nucleic acid molecule. 
If the CSNA is genomic DNA, then the CSNA is a colon specific gene (CSG). If the 
CSNA is RNA, then it is a colon specific transcript encoded by a CSG. Due to alternative 
splicing and transcriptional modification one CSG may encode for multiple colon specific 

10 RNAs. In a preferred embodiment, the nucleic acid molecule encodes a polypeptide that 
is specific to colon. More preferred is a nucleic acid molecule that encodes a polypeptide 
comprising an amino acid sequence of SEQ ID NO: 95-248. In another preferred 
embodiment, the nucleic acid molecule comprises a nucleic acid sequence of SEQ ID NO: 
1-94. For the CSNA sequences listed herein, DEX0450_001.nt.l corresponds to SEQ ID 

15 NO: 1 . For sequences with multiple splice variants, the parent sequence 

DEX0450_001.nt.l, will be followed by DEX0450J)01.nt.2, etc. for each splice variant. 
The sequences off the corresponding peptides are listed as DEX0450_001.aa. 1, etc. For 
the mapping of all of the nucleotides and peptides, see the table in the Example 1 section 
below. 

20 This aspect of the present invention also relates to nucleic acid molecules that 

selectively hybridize or exhibit substantial sequence similarity to nucleic acid molecules 
encoding a Colon Specific Protein (CSP), or that selectively hybridize or exhibit 
substantial sequence similarity to a CSNA. In one embodiment of the present invention 
the nucleic acid molecule comprises an allelic variant of a nucleic acid molecule encoding 

25 a CSP, or an allelic variant of a CSNA. In another embodiment, the nucleic acid molecule 
comprises a part of a nucleic acid sequence that encodes a CSP or a part of a nucleic acid 
sequence of a CSNA. 

In addition, this aspect of the present invention relates to a nucleic acid molecule 
further comprising one or more expression control sequences controlling the transcription 

30 and/or translation of all or a part of a CSNA or the transcription and/or translation of a 
nucleic acid molecule that encodes all or a fragment of a CSP. 

Another aspect of the present invention relates to vectors and/or host cells 
comprising a nucleic acid molecule of this invention. In a preferred embodiment, the 
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nucleic acid molecule of the vector and/or host cell encodes all or a fragment of a CSP. In 
another preferred embodiment, the nucleic acid molecule of the vector and/or host cell 
comprises all or a part of a CSNA. Vectors and host cells of the present invention are 
useful in the recombinant production of polypeptides, particularly CSPs of the present 
5 invention. 

Another aspect of the present invention relates to polypeptides encoded by a 
nucleic acid molecule of this invention. The polypeptide may comprise either a fragment 
or a full-length protein. In a preferred embodiment, the polypeptide is a CSP. However, 
this aspect of the present invention also relates to mutant proteins (muteins) of CSPs, 

10 fusion proteins of which a portion is a CSP, and proteins and polypeptides encoded by 
allelic variants of a CSNA as provided herein. 

A further aspect of the present invention is a novel splice variant which encodes an 
amino acid sequence that provides a novel region to be targeted for the generation of 
reagents that can be used in the detection and/or treatment of cancer. The novel amino 

15 acid sequence may lead to a unique protein structure, protein subcellular localization, 

biochemical processing or function. This information can be used to directly or indirectly 
facilitate the generation of additional or novel therapeutics or diagnostics. The nucleotide 
sequence in this novel splice variant can be used as a nucleic acid probe for the diagnosis 
and/or treatment of cancer. 

20 Another aspect of the present invention relates to antibodies and other binders that 

specifically bind to a polypeptide of the instant invention. Accordingly antibodies or 
binders of the present invention specifically bind to CSPs, muteins, fusion proteins, and/or 
homologous proteins or polypeptides encoded by allelic variants of a CSNA as provided 
herein. 

25 Another aspect of the present invention relates to agonists and antagonists of the 

nucleic acid molecules and polypeptides of this invention. The agonists and antagonists of 
the instant invention may be used to treat colon cancer and non-cancerous disease states in 
colon and to produce engineered colon tissue. 

Another aspect of the present invention relates to methods for using the nucleic 

30 acid molecules to detect or amplify nucleic acid molecules that have similar or identical 
nucleic acid sequences compared to the nucleic acid molecules described herein. Such 
methods are useful in identifying, diagnosing, monitoring, staging, imaging and treating 
colon cancer and/or non-cancerous disease states in colon. Such methods are also useful 
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in identifying and/or monitoring colon tissue. In addition, measurement of levels of one or 
more of the nucleic acid molecules of this invention may be useful as a diagnostic as part 
of a panel in combination with known other markers, particularly those described in the 
colon cancer background section above. 
5 Another aspect of the present invention relates to use of the nucleic acid molecules 

of this invention in gene therapy, for producing transgenic animals and cells, and for 
producing engineered colon tissue for treatment and research. 

Another aspect of the present invention relates to methods for detecting 
polypeptides of this invention, preferably using antibodies thereto. Such methods are 

10 useful to identify, diagnose, monitor, stage, image and treat colon cancer and non- 
cancerous disease states in colon. In addition, measurement of levels of one or more of 
the polypeptides of this invention may be useful to identify, diagnose, monitor, stage, 
and/or image colon cancer in combination with known other markers, particularly those 
described in the colon cancer background section above. The polypeptides of the present 

1 5 invention can also be used to identify and/or monitor colon tissue, and to produce 
engineered colon tissue. 

Yet another aspect of the present invention relates to a computer readable means of 
storing the nucleic acid and amino acid sequences of the invention. The records of the 
computer readable means can be accessed for reading and displaying of sequences for 

20 comparison, alignment and ordering of the sequences of the invention to other sequences. 
In addition, the computer records regarding the nucleic acid and/or amino acid sequences 
and/or measurements of their levels may be used alone or in combination with other 
markers to diagnose colon related diseases. 

DETAILED DESCRIPTION OF THE INVENTION 
25 Definitions and General Techniques 

Unless otherwise defined herein, scientific and technical terms used in connection 
with the present invention shall have the meanings that are commonly understood by those 
of ordinary skill in the art. Further, unless otherwise required by context, singular terms 
shall include pluralities and plural terms shall include the singular. Generally, 
30 nomenclatures used in connection with, and techniques of, cell and tissue culture, 

molecular biology, immunology, microbiology, genetics and protein and nucleic acid 
chemistry and hybridization described herein are those well known and commonly used in 
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the art. The methods and techniques of the present invention are generally performed 
according to conventional methods well known in the art and as described in various 
general and more specific references that are cited and discussed throughout the present 
specification unless otherwise indicated. See, e.g^ Sambrook et aL, Molecular Cloning: A 

5 Laboratory Manual 2d ed., Cold Spring Harbor Laboratory Press (1989) and Sambrook et 
aL, Molecular Cloning: A Laboratory Manual 3d ed., Cold Spring Harbor Press (2001); 
Ausubel et aL, Current Protocols in Molecular Biology, Greene Publishing Associates 
(1992, and Supplements to 2000); Ausubel et aL, Short Protocols in Molecular Biology: A 
Compendium of Methods from Current Protocols in Molecular Biology - 4 th Ed. , Wiley & 

10 Sons (1999); Harlow and Lane, Antibodies: A Laboratory Manual Cold Spring Harbor 
Laboratory Press (1990); and Harlow and Lane, Using Antibodies: A Laboratory Manual 
Cold Spring Harbor Laboratory Press (1999). 

Enzymatic reactions and purification techniques are performed according to 
manufacturer's specifications, as commonly accomplished in the art or as described 

1 5 herein. The nomenclatures used in connection with, and the laboratory procedures and 
techniques of, analytical chemistry, synthetic organic chemistry, and medicinal and 
pharmaceutical chemistry described herein are those well known and commonly used in 
the art. Standard techniques are used for chemical syntheses, chemical analyses, 
pharmaceutical preparation, formulation, and delivery, and treatment of patients. 

20 The following terms, unless otherwise indicated, shall be understood to have the 

following meanings: 

A "nucleic acid molecule" of this invention refers to a polymeric form of 
nucleotides and includes both sense and antisense strands of RNA, cDNA, genomic DNA, 
and synthetic forms and mixed polymers of the above. A nucleotide refers to a 

25 ribonucleotide, deoxynucleotide or a modified form of either type of nucleotide. A 
"nucleic acid molecule" as used herein is synonymous with "nucleic acid" and 
"polynucleotide." The term "nucleic acid molecule" usually refers to a molecule of at 
least 10 bases in length, unless otherwise specified. The term includes single- and double- 
stranded forms of DNA. In addition, a polynucleotide may include either or both naturally 

30 occurring and modified nucleotides linked together by naturally occurring and/or 
non-naturally occurring nucleotide linkages. 

Nucleotides are represented by single letter symbols in nucleic acid molecule 
sequences. The following table lists symbols identifying nucleotides or groups of 
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; nucleotides which may occupy the symbol position on a nucleic acid molecule. See 

i Nomenclature Committee of the International Union of Biochemistry (NC-IUB), 



Nomenclature for incompletely specified bases in nucleic acid sequences, 
Recommendations 1984., Eur J Biochem. 150(l):l-5 (1985). 



Symbol 


Meaning 


Group/Origin of Designation 


Complementary 
Symbol 


a 


a 


Adenine 


t/u 


g 


9 


Guanine 


c 


c 


c 


Cytosine 


g 


t 


t 


Thymine 


a 


u 


u 


Uracil 


a 


r 


g or a 


puRine 


y 


y 


t/u or c 


pYrimidine 


r 


m 


a or c 


aMino 


k 


k 


g or t/u 


Keto 


m 


s 


g or c 


Strong interactions 3H-bonds 


w 


w 


a or t/u 


Weak interactions 2H-bonds 


s 


b 


g or c or t/u 


not a 


V 


d 


a or g or t/u 


not c 


h 


h 


a or c or t/u 


not g 


d 


V 


a or g or c 


not t, not u 


b 


n 


a or g or c 
or t/u, 
unknown , or 
other 


aNy 


n 



5 

j The nucleic acid molecules may be modified chemically or biochemically or may 

! contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those 

of skill in the art. Such modifications include, for example, labels, methylation, 
substitution of one or more of die naturally occurring nucleotides with an analog, 
10 internucleotide modifications such as uncharged linkages (e.g. , methyl phosphonates, 
1 phosphotriesters, phosphoramidates, carbamates, etc.), charged linkages (e.g. , 

■. ; phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g. , polypeptides), 

intercalators (e.g. y acridine, psoralen, etc.), chelators, alkylators, and modified linkages 
(e.g., alpha anomeric nucleic acids, etc.) The term "nucleic acid molecule" also includes 
15 any topological conformation, including single-stranded, double-stranded, partially 

duplexed, triplexed, hairpinned, circular and padlocked conformations. Also included are 
synthetic molecules that mimic polynucleotides in their ability to bind to a designated 
sequence via hydrogen bonding and other chemical interactions. Such molecules are 
known in the art and include, for example, those in which peptide linkages substitute for 
20 phosphate linkages in the backbone of the molecule. 

:j 



! 
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A "gene" is defined as a nucleic acid molecule that comprises a nucleic acid 
sequence that encodes a polypeptide and the expression control sequences that surround 
the nucleic acid sequence that encodes the polypeptide. For instance, a gene may 
comprise a promoter, one or more enhancers, a nucleic acid sequence that encodes a 
5 polypeptide, downstream regulatory sequences and, possibly, other nucleic acid sequences 
involved in regulation of the expression of an RNA. As is well known in the art, 
eukaryotic genes usually contain both exons and introns. The term "exon" refers to a 
nucleic acid sequence found in genomic DNA that is bioinformatically predicted and/or 
experimentally confirmed to contribute contiguous sequence to a mature mRNA 

10 transcript. The term "intron" refers to a nucleic acid sequence found in genomic DNA that 
is predicted and/or confirmed to not contribute to a mature mRNA transcript, but rather to 
be "spliced out" during processing of the transcript. 

A nucleic acid molecule or polypeptide is "derived" from a particular species if the 
nucleic acid molecule or polypeptide has been isolated from the particular species, or if the 

15 nucleic acid molecule or polypeptide is homologous to a nucleic acid molecule or 
polypeptide isolated from a particular species. 

An "isolated" or "substantially pure" nucleic acid or polynucleotide (e.g., an RNA, 
DNA or a mixed polymer) is one which is substantially separated from other cellular 
components that naturally accompany the native polynucleotide in its natural host cell, 

20 e.g., ribosomes, polymerases, or genomic sequences with which it is naturally associated. 
The term embraces a nucleic acid or polynucleotide that (1) has been removed from its 
naturally occurring environment, (2) is not associated with all or a portion of a 
polynucleotide in which the "isolated polynucleotide" is found in nature, (3) is operatively 
linked to a polynucleotide which it is not linked to in nature, (4) does not occur in nature 

25 as part of a larger sequence or (5) includes nucleotides or intemucleoside bonds that are 
not found in nature. The term "isolated" or "substantially pure" also can be used in 
reference to recombinant or cloned DNA isolates, chemically synthesized polynucleotide 
analogs, or polynucleotide analogs that are biologically synthesized by heterologous 
systems. The term "isolated nucleic acid molecule" includes nucleic acid molecules that 

30 are integrated into a host cell chromosome at a heterologous site, recombinant fusions of a 
native fragment to a heterologous sequence, recombinant vectors present as episomes or as 
integrated into a host cell chromosome. 
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A "part" of a nucleic acid molecule refers to a nucleic acid molecule that 
comprises a partial contiguous sequence of at least 10 bases of the reference nucleic acid 
molecule. Preferably, a part comprises at least 15 to 20 bases of a reference nucleic acid 
molecule. In theory, a nucleic acid sequence of 17 nucleotides is of sufficient length to 
5 occur at random less frequently than once in the three gigabase human genome, and thus 
provides a nucleic acid probe that can uniquely identify the reference sequence in a 
nucleic acid mixture of genomic complexity. A preferred part is one that comprises a 
nucleic acid sequence that can encode at least 6 contiguous amino acid sequences 
(fragments of at least 18 nucleotides) because they are useful in directing the expression or 

10 synthesis of peptides that are useful in mapping the epitopes of the polypeptide encoded 
by the reference nucleic acid. See, e.g., Geysen et al. y Proc. Natl Acad. Sci. USA 
81:3998-4002 (1984); and U.S. Patent Nos. 4,708,871 and 5,595,915, the disclosures of 
which are incorporated herein by reference in their entireties. A part may also comprise at 
least 25, 30, 35 or 40 nucleotides of a reference nucleic acid molecule, or at least 50, 60, 

15 70, 80, 90, 100, 150, 200, 250, 300, 350, 400 or 500 nucleotides of a reference nucleic 
acid molecule. A part of a nucleic acid molecule may comprise no other nucleic acid 
sequences. Alternatively, a part of a nucleic acid may comprise other nucleic acid 
sequences from other nucleic acid molecules. 

The term "oligonucleotide" refers to a nucleic acid molecule generally comprising 

20 a length of 200 bases or fewer. The term often refers to single-stranded 

deoxyribonucleotides, but it can refer as well to single-or double-stranded ribonucleotides, 
RNA:DNA hybrids and double-stranded DNAs, among others. Preferably, 
oligonucleotides are 10 to 60 bases in length and most preferably 12, 13, 14, 15, 16, 17, 
18, 19 or 20 bases in length. Other preferred oligonucleotides are 25, 30, 35, 40, 45, 50, 

25 55 or 60 bases in length. Oligonucleotides may be single-stranded, e.g. for use as probes 
or primers, or may be double-stranded, e.g. for use in the construction of a mutant gene. 
Oligonucleotides of the invention can be either sense or antisense oligonucleotides. An 
oligonucleotide can be derivatized or modified as discussed above for nucleic acid 
molecules. 

30 Oligonucleotides, such as single-stranded DNA probe oligonucleotides, often are 

synthesized by chemical methods, such as those implemented on automated 
oligonucleotide synthesizers. However, oligonucleotides can be made by a variety of 
other methods, including in vitro recombinant DNA-mediated techniques and by 
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expression of DNAs in cells and organisms. Initially, chemically synthesized DNAs 
typically are obtained without a 5' phosphate. The 5' ends of such oligonucleotides are 
not substrates for phosphodiester bond formation by ligation reactions that employ DNA 
ligases typically used to form recombinant DNA molecules. Where ligation of such 
5 oligonucleotides is desired, a phosphate can be added by standard techniques, such as 
those that employ a kinase and ATP. The 3' end of a chemically synthesized 
oligonucleotide generally has a free hydroxyl group and, in the presence of a ligase, such 
as T4 DNA ligase, readily will form a phosphodiester bond with a 5' phosphate of another 
polynucleotide, such as another oligonucleotide. As is well known, this reaction can be 

10 prevented selectively, where desired, by removing the 5' phosphates of the other 
polynucleotide(s) prior to ligation. 

The term "naturally occurring nucleotide" referred to herein includes naturally 
occurring deoxyribonucleotides and ribonucleotides. The term "modified nucleotides" 
referred to herein includes nucleotides with modified or substituted sugar groups and the 

15 like. The term "nucleotide linkages" referred to herein includes nucleotide linkages such 
as phosphorothioate, phosphorodithioate, phosphoroselenoate, phosphorodiseienoate, 
phosphoroanilothioate, phoshoraniladate, phosphoroamidate, and the like. See e.g., 
LaPlanche et al Nuci Acids Res. 14:9081-9093 (1986); Stein et al Nuci Acids Res. 
16:3209-3221 (1988); Zon etal Anti-Cancer Drug Design 6:539-568 (1991); Zon etal, 

20 in Eckstein (ed.) Oligonucleotides and Analogues: A Practical Approach, pp. 87-108, 
Oxford University Press (1991); Uhlmann and Peyman Chemical Reviews 90:543 (1990), 
and U.S. Patent No. 5,151,510, the disclosure of which is hereby incorporated by 
reference in its entirety. 

Unless specified otherwise, the left hand end of a polynucleotide sequence in sense 

25 orientation is the 5' end and the right hand end of the sequence is the V end. In addition, 
the left hand direction of a polynucleotide sequence in sense orientation is referred to as 
the 5 5 direction, while the right hand direction of the polynucleotide sequence is referred 
to as the 3' direction. Further, unless otherwise indicated, each nucleotide sequence is set 
forth herein as a sequence of deoxyribonucleotides. It is intended, however, that the given 

30 sequence be interpreted as would be appropriate to the polynucleotide composition: for 
example, if the isolated nucleic acid is composed of RNA, the given sequence intends 
ribonucleotides, with uridine substituted for thymidine. 
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The term "allelic variant" refers to one of two or more alternative naturally 
occurring forms of a gene, wherein each gene possesses a unique nucleotide sequence. In 
a preferred embodiment, different alleles of a given gene have similar or identical 
biological properties. 

5 The term "percent sequence identity" in the context of nucleic acid sequences 

refers to the residues in two sequences which are the same when aligned for maximum 
correspondence. The length of sequence identity comparison may be over a stretch of at 
least about nine nucleotides, usually at least about 20 nucleotides, more usually at least 
about 24 nucleotides, typically at least about 28 nucleotides, more typically at least about 

10 32 nucleotides, and preferably at least about 36 or more nucleotides. There are a number 
of different algorithms known in the art which can be used to measure nucleotide sequence 
identity. For instance, polynucleotide sequences can be compared using FASTA, Gap or 
Bestfit, which are programs in Wisconsin Package Version 10.0, Genetics Computer 
Group (GCG), Madison, Wisconsin. FASTA, which includes, e.g., the programs FASTA2 

15 and FASTA3, provides alignments and percent sequence identity of the regions of the best 
overlap between the query and search sequences (Pearson, Methods Enzymol. 183: 63-98 
(1990); Pearson, Methods Mol Biol 132: 185-219 (2000); Pearson, Methods Enzymol 
266: 227-258 (1996); Pearson, J. Mol Biol 276: 71-84 (1998)). Unless otherwise 
specified, default parameters for a particular program or algorithm are used. For instance, 

20 percent sequence identity between nucleic acid sequences can be determined using 
FASTA with its default parameters (a word size of 6 and the NOP AM factor for the 
scoring matrix) or using Gap with its default parameters as provided in GCG Version 6. 1 . 

A reference to a nucleic acid sequence encompasses its complement unless 
otherwise specified. Thus, a reference to a nucleic acid molecule having a particular 

25 sequence should be understood to encompass its complementary strand, with its 

complementary sequence. The complementary strand is also useful, e.g., for antisense 
therapy, double-stranded RNA (dsRNA) inhibition (RNAi), combination of triplex and 
antisense, hybridization probes and PCR primers. 

In the molecular biology art, researchers use the terms "percent sequence identity", 

30 "percent sequence similarity" and "percent sequence homology" interchangeably. In this 
application, these terms shall have the same meaning with respect to nucleic acid 
sequences only. 
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The term "substantial similarity" or "substantial sequence similarity," when 
referring to a nucleic acid or fragment thereof, indicates that, when optimally aligned with 
appropriate nucleotide insertions or deletions with another nucleic acid (or its 
complementary strand), there is nucleotide sequence identity in at least about 50%, more 

5 preferably 60% of the nucleotide bases, usually at least about 70%, more usually at least 
about 80%, preferably at least about 90%, and more preferably at least about 95-98% of 
the nucleotide bases, as measured by any well known algorithm of sequence identity, such 
as FAST A, BLAST or Gap, as discussed above. 

Alternatively, substantial similarity exists between a first and second nucleic acid 

0 sequence when the first nucleic acid sequence or fragment thereof hybridizes to an 
antisense strand of the second nucleic acid, under selective hybridization conditions. 
Typically, selective hybridization will occur between the first nucleic acid sequence and 
an antisense strand of the second nucleic acid sequence when there is at least about 55% 
sequence identity between the first and second nucleic acid sequences — preferably at least 

5 about 65%, more preferably at least about 75%, and most preferably at least about 90% — 
over a stretch of at least about 14 nucleotides, more preferably at least 17 nucleotides, 
even more preferably at least 20, 25, 30, 35, 40, 50, 60, 70, 80, 90 or 100 nucleotides. 

Nucleic acid hybridization will be affected by such conditions as salt 
concentration, temperature, solvents, the base composition of the hybridizing species, 

3 length of the complementary regions, and the number of nucleotide base mismatches 

between the hybridizing nucleic acids, as will be readily appreciated by those skilled in the 
art. "Stringent hybridization conditions" and "stringent wash conditions" in the context 
of nucleic acid hybridization experiments depend upon a number of different physical 
parameters. The most important parameters include temperature of hybridization, base 

5 composition of the nucleic acids, salt concentration and length of the nucleic acid. One 
having ordinary skill in the art knows how to vary these parameters to achieve a particular 
stringency of hybridization. In general, "stringent hybridization" is performed at about 
25°C below the thermal melting point (T m ) for the specific DNA hybrid under a particular 
set of conditions. "Stringent washing" is performed at temperatures about 5°C lower than 

) the T m for the specific DNA hybrid under a particular set of conditions. The T m is the 
temperature at which 50% of the target sequence hybridizes to a perfectly matched probe. 
See Sambrook (1989), supra, p. 9.51. 

The T m for a particular DNA-DNA hybrid can be estimated by the formula: 
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T m = 81.5°C + 16.6 (logjofNa*]) + 0.41 (fraction G + C) - 
0.63 (% formamide) - (600/1) where 1 is the length of the hybrid in base pairs. 
The T m for a particular RNA-RNA hybrid can be estimated by the formula: 
T m = 79.8°C + 18.5 (log l0 [Na + ]) + 0.58 (fraction G + C) + 
5 11.8 (fraction G + C) 2 - 0.35 (% formamide) - (820/1). 

The T m for a particular RNA-DNA hybrid can be estimated by the formula: 
T m = 79.8°C + 18.5(logio[Na + ]) + 0.58 (fraction G + C) + 
11.8 (fraction G + C) 2 - 0.50 (% formamide) - (820/1). 

In general, the T m decreases by 1-1. 5°C for each 1% of mismatch between two 

10 nucleic acid sequences. Thus, one having ordinary skill in the art can alter hybridization 
and/or washing conditions to obtain sequences that have higher or lower degrees of 
sequence identity to the target nucleic acid. For instance, to obtain hybridizing nucleic 
acids that contain up to 10% mismatch from the target nucleic acid sequence, 10-15°C 
would be subtracted from the calculated T m of a perfectly matched hybrid, and then the 

1 5 hybridization and washing temperatures adjusted accordingly. Probe sequences may also 
hybridize specifically to duplex DNA under certain conditions to form triplex or other 
higher order DNA complexes. The preparation of such probes and suitable hybridization 
conditions are well known in the art. 

An example of stringent hybridization conditions for hybridization of 

20 complementary nucleic acid sequences having more than 100 complementary residues on 
a filter in a Southern or Northern blot or for screening a library is 50% formamide/6X SSC 
at 42°C for at least ten hours and preferably overnight (approximately 16 hours). Another 
example of stringent hybridization conditions is 6X SSC at 68°C without formamide for at 
least ten hours and preferably overnight. An example of moderate stringency 

25 hybridization conditions is 6X SSC at 55°C without formamide for at least ten hours and 
preferably overnight. An example of low stringency hybridization conditions for 
hybridization of complementary nucleic acid sequences having more than 100 
complementary residues on a filter in a Southern or northern blot or for screening a library 
is 6X SSC at 42°C for at least ten hours. Hybridization conditions to identify nucleic acid 

30 sequences that are similar but not identical can be identified by experimentally changing 
the hybridization temperature from 68°C to 42°C while keeping the salt concentration 
constant (6X SSC), or keeping the hybridization temperature and salt concentration 
constant (e.g. 42°C and 6X SSC) and varying the formamide concentration from 50% to 
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0%. Hybridization buffers may also include blocking agents to lower background. These 
agents are well known in the art. See Sambrook et al (1989), supra, pages 8.46 and 9.46- 
9.58. See also Ausubel (1992), supra, Ausubel (1999), supra, and Sambrook (2001), 
supra. 

5 Wash conditions also can be altered to change stringency conditions. An example 

of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes {see Sambrook 
(1989), supra, for SSC buffer). Often the high stringency wash is preceded by a low 
stringency wash to remove excess probe. An exemplary medium stringency wash for 
duplex DNA of more than 100 base pairs is lx SSC at 45°C for 15 minutes. An 

10 exemplary low stringency wash for such a duplex is 4x SSC at 40°C for 15 minutes. In 
general, signal-to-noise ratio of 2x or higher than that observed for an unrelated probe in 
the particular hybridization assay indicates detection of a specific hybridization. 

As defined herein, nucleic acids that do not hybridize to each other under stringent 
conditions are still substantially similar to one another if they encode polypeptides that are 

15 substantially identical to each other. This occurs, for example, when a nucleic acid is 
created synthetically or recombinantly using a high codon degeneracy as permitted by the 
redundancy of the genetic code. 

Hybridization conditions for nucleic acid molecules that are shorter than 100 
nucleotides in length (e.g., for oligonucleotide probes) may be calculated by the formula: 

20 T m = 81.5°C + 16.6(log 10 [Na + ]) + 0.41(fraction G+C) -(600/N), wherein N is 

change length and the [Na + ] is 1 M or less. See Sambrook (1989), supra, p. 1 1.46. For 
hybridization of probes shorter than 100 nucleotides, hybridization is usually performed 
under stringent conditions (5-10°C below the T m ) using high concentrations (0.1-1.0 
pmol/ml) of probe. Mat p. 11.45. Determination of hybridization using mismatched 

25 probes, pools of degenerate probes or "guessmers," as well as hybridization solutions and 
methods for empirically determining hybridization conditions are well known in the art. 
See, e.g., Ausubel (1999), supra; Sambrook (1989), supra, pp. 1 1.45-1 1.57. 

The term "digestion" or "digestion of DNA 5 ' refers to catalytic cleavage of the 
DNA with a restriction enzyme that acts only at certain sequences in the DNA. The 

30 various restriction enzymes referred to herein are commercially available and their 

reaction conditions, cofactors and other requirements for use are known and routine to the 
skilled artisan. For analytical purposes, typically, 1 \ig of plasmid or DNA fragment is 
digested with about 2 units of enzyme in about 20 \il of reaction buffer. For the purpose of 
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i 

| isolating DNA fragments for plasmid construction, typically 5 to 50 \ig of DNA are 

i 

digested with 20 to 250 units of enzyme in proportionately larger volumes. Appropriate 
buffers and substrate amounts for particular restriction enzymes are described in standard 
laboratory manuals, such as those referenced below, and are specified by commercial 
5 suppliers. Incubation times of about 1 hour at 37°C are ordinarily used, but conditions 
may vary in accordance with standard procedures, the supplier's instructions and the 
particulars of the reaction. After digestion, reactions may be analyzed, and fragments may 
be purified by electrophoresis through an agarose or polyacrylamide gel, using well 
known methods that are routine for those skilled in the art. 

:' : .'> 

i 10 The term "ligation" refers to the process of forming phosphodiester bonds between 

two or more polynucleotides, which most often are double-stranded DNAs. Techniques 
for ligation are well known to the art and protocols for ligation are described in standard 
laboratory manuals and references, such as, e.g., Sambrook (1989), supra. 

Genome-derived "single exon probes," are probes that comprise at least part of an 
15 exon ("reference exon") and can hybridize detectably under high stringency conditions to 
transcript-derived nucleic acids that include the reference exon but do not hybridize 
detectably under high stringency conditions to nucleic acids that lack the reference exon. 
| Single exon probes typically further comprise, contiguous to a first end of the exon 

! portion, a first intronic and/or intergenic sequence that is identically contiguous to the 

j 20 exon in the genome, and may contain a second intronic and/or intergenic sequence that is 

identically contiguous to the exon in the genome. The minimum length of genome- 
derived single exon probes is defined by the requirement that the exonic portion be of 
sufficient length to hybridize under high stringency conditions to transcript-derived 
nucleic acids, as discussed above. The maximum length of genome-derived single exon 
25 probes is defined by the requirement that the probes contain portions of no more than one 
exon. The single exon probes may contain priming sequences not found in contiguity with 
the rest of the probe sequence in the genome, which priming sequences are useful for PCR 
and other amplification-based technologies. In another aspect, the invention is directed to 
single exon probes based on the CSNAs disclosed herein. 
30 In one embodiment, the term "microarray" refers to a "nucleic acid microarray" 

having a substrate-bound plurality of nucleic acids, hybridization to each of the plurality 
of bound nucleic acids being separately detectable. The substrate can be solid or porous, 
planar or non-planar, unitary or distributed. Nucleic acid microarrays include all the 
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devices so called in Schena (ed.), DNA Microarravs: A Practical Approach (Practical 
A pproach Series) , Oxford University Press (1999); Nature Genet. 21(l)(suppl.):l - 60 

(1999) ; Schena (ed.), Microarray Biochip: Tools and Technology, Eaton Publishing 
Company/BioTechniques Books Division (2000). Additionally, these nucleic acid 

5 microarrays include a substrate-bound plurality of nucleic acids in which the plurality of 
nucleic acids are disposed on a plurality of beads, rather than on a unitary planar substrate, 
as is described, inter alia, in Brenner et ai, Proc. Natl Acad. Set USA 97(4): 1665-1670 

(2000) . Examples of nucleic acid microarrays may be found in U.S. Patent Nos. 
6,391,623, 6,383,754, 6,383,749, 6,380,377, 6,379,897, 6,376,191, 6,372,431, 6,351,712 

10 6,344,316, 6,316,193, 6,312,906, 6,309,828, 6,309,824, 6,306,643, 6,300,063, 6,287,850, 
6,284,497, 6,284,465, 6,280,954, 6,262,216, 6,251,601, 6,245,518, 6,263,287, 6,251,601, 
6,238,866, 6,228,575, 6,214,587, 6,203,989, 6,171,797, 6,103,474, 6,083,726, 6,054,274, 
6,040,138, 6,083,726, 6,004,755, 6,001,309, 5,958,342, 5,952,180, 5,936,731, 5,843,655, 
5,814,454, 5,837,196, 5,436,327, 5,412,087, and 5,405,783, the disclosures of which are 

1 5 incorporated herein by reference in their entireties. 

In an alternative embodiment, a "microarray" may also refer to a "peptide 
microarray" or "protein microarray" having a substrate-bound collection or plurality of 
polypeptides, the binding to each of the plurality of bound polypeptides being separately 
detectable. Alternatively, the peptide microarray may have a plurality of binders, 

20 including but not limited to monoclonal antibodies, polyclonal antibodies, phage display 
binders, yeast 2 hybrid binders, and aptamers, which can specifically detect the binding of 
the polypeptides of this invention. The array may be based on autoantibody detection to 
the polypeptides of this invention, see Robinson et al, Nature Medicine 8(3):295-301 
(2002). Examples of peptide arrays may be found in WO 02/31463, WO 02/25288, WO 

25 01/94946, WO 01/88162, WO 01/68671, WO 01/57259, WO 00/61806, WO 00/54046, 
WO 00/47774, WO 99/40434, WO 99/39210, and WO 97/42507 and U.S. Patent Nos. 
6,268,210, 5,766,960, and 5,143,854, the disclosures of which are incorporated herein by 
reference in their entireties. 

In addition, determination of the levels of the CSNA or CSP may be made in a 

30 multiplex manner using techniques described in WO 02/29 109, WO 02/24959, WO 
01/83502, WO01/731 13, WO 01/59432, WO 01/57269, and WO 99/67641, the 
disclosures of which are incorporated herein by reference in their entireties. 
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The term "mutant", "mutated", or "mutation" when applied to nucleic acid 
sequences means that nucleotides in a nucleic acid sequence may be inserted, deleted or 
changed compared to a reference nucleic acid sequence. A single alteration may be made 
at a locus (a point mutation) or multiple nucleotides may be inserted, deleted or changed at 
5 a single locus. In addition, one or more alterations may be made at any number of loci 
within a nucleic acid sequence. In a preferred embodiment of the present invention, the 
nucleic acid sequence is the wild type nucleic acid sequence encoding a CSP or is a 
CSNA. The nucleic acid sequence may be mutated by any method known in the art 
including those mutagenesis techniques described infra, 
10 The term "error-prone PCR" refers to a process for performing PCR under 

conditions where the copying fidelity of the DNA polymerase is low, such that a high rate 
of point mutations is obtained along the entire length of the PCR product. See, e.g., Leung 
etal, Technique 1: 11-15 (1989) and Caldwell et aL, PCR Methods Applic. 2: 28-33 
(1992). 

15 The term "oligonucleotide-directed mutagenesis" refers to a process which enables 

the generation of site-specific mutations in any cloned DNA segment of interest See, e.g., 

Reidhaar-Olson et aL, Science 241: 53-57 (1988). 

The term "assembly PCR" refers to a process which involves the assembly of a 

PCR product from a mixture of small DNA fragments. A large number of different PCR 
20 reactions occur in parallel in the same vial, with the products of one reaction priming the 

products of another reaction. 

The term "sexual PCR mutagenesis" or "DNA shuffling" refers to a method of 

error-prone PCR coupled with forced homologous recombination between DNA 

molecules of different but highly related DNA sequence in vitro, caused by random 
25 fragmentation of the DNA molecule based on sequence similarity, followed by fixation of 

the crossover by primer extension in an error-prone PCR reaction. See, e.g., Stemmer, 

Proc. Natl. Acad. Sci. U.S.A. 91: 10747-10751 (1994). DNA shuffling can be carried out 

between several related genes ("Family shuffling"). 

The term "«i vivo mutagenesis" refers to a process of generating random mutations 
30 in any cloned DNA of interest which involves the propagation of the DNA in a strain of 

bacteria such as E. coli that carries mutations in one or more of the DNA repair pathways. 

These "mutator" strains have a higher random mutation rate than that of a wild-type 
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parent Propagating the DNA in a mutator strain will eventually generate random 
mutations within the DNA. 

The term "cassette mutagenesis" refers to any process for replacing a small region 
of a double-stranded DNA molecule with a synthetic oligonucleotide "cassette" that 
5 differs from the native sequence. The oligonucleotide often contains completely and/or 
partially randomized native sequence. 

The term "recursive ensemble mutagenesis" refers to an algorithm for protein 
engineering (protein mutagenesis) developed to produce diverse populations of 
phenotypically related mutants whose members differ in amino acid sequence. This 
10 method uses a feedback mechanism to control successive rounds of combinatorial cassette 
mutagenesis. See, e.g., Arkin et al, Proc. Natl Acad. Sci. U.S.A. 89: 781 1-7815 (1992). 

The term "exponential ensemble mutagenesis" refers to a process for generating 
combinatorial libraries with a high percentage of unique and functional mutants, wherein 
small groups of residues are randomized in parallel to identify, at each altered position, 
15 amino acids which lead to functional proteins. See, e.g., Delegrave et ah, Biotechnology 
Research 11: 1548-1552 (1993); Arnold, Current Opinion in Biotechnology 4: 450-455 
(1993). 

"Operatively linked" expression control sequences refers to a linkage in which the 
expression control sequence is either contiguous with the gene of interest to control the 

20 gene of interest, or acts in trans or at a distance to control the gene of interest. 

The term "expression control sequence" as used herein refers to polynucleotide 
sequences which are necessary to affect the expression of coding sequences to which they 
are operatively linked. Expression control sequences are sequences which control the 
transcription, post-transcriptional events and translation of nucleic acid sequences. 

25 Expression control sequences include appropriate transcription initiation, termination, 
promoter and enhancer sequences; efficient RNA processing signals such as splicing and 
polyadenylation signals; sequences that stabilize cytoplasmic mRNA; sequences that 
enhance translation efficiency (e.g., ribosome binding sites); sequences that enhance 
protein stability; and when desired, sequences that enhance protein secretion. The nature 

30 of such control sequences differs depending upon the host organism; in prokaryotes, such 
control sequences generally include promoter, ribosomal binding site, and transcription 
termination sequence. The term "control sequences" is intended to include, at a minimum, 
all components whose presence is essential for expression, and can also include additional 
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components whose presence is advantageous, for example, leader sequences and fusion 
partner sequences. 

The term "vector," as used herein, is intended to refer to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
5 vector is a "plasmid", which refers to a circular double-stranded DNA loop into which 
additional DNA segments may be ligated. Other vectors include cosmids, bacterial 
artificial chromosomes (BAC) and yeast artificial chromosomes (YAC). Another type of 
vector is a viral vector, wherein additional DNA segments may be ligated into the viral 
genome. Viral vectors that infect bacterial cells are referred to as bacteriophages. Certain 

10 vectors are capable of autonomous replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of replication). Other vectors can be 
integrated into the genome of a host cell upon introduction into the host cell, and thereby 
are replicated along with the host genome. Moreover, certain vectors are capable of 
directing the expression of genes to which they are operatively linked. Such vectors are 

15 referred to herein as "recombinant expression vectors" (or simply, "expression vectors"). 
In general, expression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids. In the present specification, "plasmid" and "vector" may be used 
interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include other forms of expression vectors that serve equivalent 

20 functions. 

The term "recombinant host cell" (or simply "host cell"), as used herein, is 

intended to refer to a cell into which a recombinant expression vector has been introduced. 

It should be understood that such terms are intended to refer not only to the particular 

subject cell but to the progeny of such a cell. Because certain modifications may occur in 
25 succeeding generations due to either mutation or environmental influences, such progeny 

may not, in fact, be identical to the parent cell, but are still included within the scope of 

the term "host cell" as used herein. 

As used herein, the phrase "open reading frame" and the equivalent acronym 

"ORF" refers to that portion of a transcript-derived nucleic acid that can be translated in its 
30 entirety into a sequence of contiguous amino acids. As so defined, an ORF has length, 

measured in nucleotides, exactly divisible by 3. As so defined, an ORF need not encode 

the entirety of a natural protein. 
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As used herein, the phrase "ORF-encoded peptide" refers to the predicted or actual 
translation of an ORF. 

As used herein, the phrase "degenerate variant" of a reference nucleic acid 
sequence is meant to be inclusive of all nucleic acid sequences that can be directly 
5 translated, using the standard genetic code, to provide an amino acid sequence identical to 
that translated from the reference nucleic acid sequence. 

The term "polypeptide" encompasses both naturally occurring and non-naturally 
occurring proteins and polypeptides, as well as polypeptide fragments and polypeptide 
mutants, derivatives and analogs thereof A polypeptide may be monomeric or polymeric. 

10 Further, a polypeptide may comprise a number of different modules within a single 

polypeptide each of which has one or more distinct activities. A preferred polypeptide in 
accordance with the invention comprises a CSP encoded by a nucleic acid molecule of the 
instant invention, or a fragment, mutant, analog or derivative thereof. 

The term "isolated protein" or "isolated polypeptide" is a protein or polypeptide 

1 5 that by virtue of its origin or source of derivation (I) is not associated with naturally 
associated components that accompany it in its native state, (2) is free of other proteins 
from the same species (3) is expressed by a cell from a different species, or (4) does not 
occur in nature. Thus, a polypeptide that is chemically synthesized or synthesized in a 
cellular system different from the cell from which it naturally originates will be "isolated" 

20 from its naturally associated components. A polypeptide or protein may also be rendered 
substantially free of naturally associated components by isolation, using protein 
purification techniques well known in the art. 

A protein or polypeptide is "substantially pure," "substantially homogeneous" or 
"substantially purified" when at least about 60% to 75% of a sample exhibits a single 

25 species of polypeptide. The polypeptide or protein may be monomeric or multimeric. A 
substantially pure polypeptide or protein will typically comprise about 50%, 60%, 70%, 
80% or 90% W/W of a protein sample, more usually about 95%, and preferably will be 
over 99% pure. Protein purity or homogeneity may be determined by a number of means 
well known in the art, such as polyacrylamide gel electrophoresis of a protein sample, 

30 followed by visualizing a single polypeptide band upon staining the gel with a stain well 
known in the art. For certain purposes, higher resolution may be provided by using HPLC 
or other means well known in the art for purification. 
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The term "fragment" when used herein with respect to polypeptides of the present 
invention refers to a polypeptide that has an ammo-terminal and/or carboxy-terminal 
deletion compared to a full-length CSP. In a preferred embodiment, the fragment is a 
contiguous sequence in which the amino acid sequence of the fragment is identical to the 
5 corresponding positions in the naturally occurring polypeptide. Fragments typically are at 
least 5, 6, 7, 8, 9 or 10 amino acids long, preferably at least 12, 14, 16 or 18 amino acids 
long, more preferably at least 20 amino acids long, more preferably at least 25, 30, 35, 40 
or 45, amino acids, even more preferably at least 50 or 60 amino acids long, and even 
more preferably at least 70 amino acids long. 

10 A "derivative" when used herein with respect to polypeptides of the present 

invention refers to a polypeptide which is substantially similar in primary structural 
sequence to a CSP but which includes, e.g., in vivo or in vitro chemical and biochemical 
modifications that are not found in the CSP. Such modifications include, for example, 
acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, 

15 covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide 
derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, 
formation of covalent cross-links, formation of cystine, formation of pyroglutamate, 
formylation, gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, 

20 iodination, methylation, myristoylation, oxidation, proteolytic processing, 

phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
Other modifications include, e.g., labeling with radionuclides, and various enzymatic 
modifications, as will be readily appreciated by those skilled in the art. A variety of 

25 methods for labeling polypeptides and of substituents or labels useful for such purposes 
are well known in the art, and include radioactive isotopes such as 125 1, 32 P, 35 S, 14 C and 
3 H, ligands which bind to labeled antiligands (e.g., antibodies), fluorophores, 
chemiluminescent agents, enzymes, and antiligands which can serve as specific binding 
pair members for a labeled ligand; The choice of label depends on the sensitivity required, 

30 ease of conjugation with the primer, stability requirements, and available instrumentation. 
Methods for labeling polypeptides are well known in the art. See Ausubel (1992), supra; 
Ausubel (1999), supra. 
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The term "fusion protein" refers to polypeptides of the present invention coupled 
to a heterologous amino acid sequence. Fusion proteins are useful because they can be 
constructed to contain two or more desired functional elements from two or more different 
proteins. A fusion protein comprises at least 10 contiguous amino acids from a 
5 polypeptide of interest, more preferably at least 20 or 30 amino acids, even more 

preferably at least 40, 50 or 60 amino acids, yet more preferably at least 75, 100 or 125 
amino acids. Fusion proteins can be produced recombinantly by constructing a nucleic 
acid sequence that encodes the polypeptide or a fragment thereof in frame with a nucleic 
acid sequence encoding a different protein or peptide and then expressing the fusion 

10 protein. Alternatively, a fusion protein can be produced chemically by crosslinking the 
polypeptide or a fragment thereof to another protein. 

The term "analog" refers to both polypeptide analogs and non-peptide analogs. 
The term "polypeptide analog" as used herein refers to a polypeptide that is comprised of a 
segment of at least 25 amino acids that has substantial identity to a portion of an amino 

15 acid sequence but which contains non-natural amino acids or non-natural inter-residue 
bonds. In a preferred embodiment, the analog has the same or similar biological activity 
as the native polypeptide. Typically, polypeptide analogs comprise a conservative amino 
acid substitution (or insertion or deletion) with respect to the naturally occurring sequence. 
Analogs typically are at least 20 amino acids long, preferably at least 50 amino acids long 

20 or longer, and can often be as long as a full-length naturally occurring polypeptide. 

The term "non-peptide analog" refers to a compound with properties that are 
analogous to those of a reference polypeptide. A non-peptide compound may also be 
termed a "peptide mimetic" or a "peptidomimetic." Such compounds are often developed 
with the aid of computerized molecular modeling. Peptide mimetics that are structurally 

25 similar to useful peptides may be used to produce an equivalent effect Generally, 

peptidomimetics are structurally similar to a paradigm polypeptide (Le., a polypeptide that 
has a desired biochemical property or pharmacological activity), but have one or more 
peptide linkages optionally replaced by a linkage selected from the group consisting of: 
--CH 2 NH~, ~CH 2 S-, -CH 2 -CH 2 ~, -CH=CH-(cis and trans), -COCH 2 -, 

30 ~CH(OH)CH 2 ~, and -CH 2 SO~, by methods well known in the art Systematic 

substitution of one or more amino acids of a consensus sequence with a D-amino acid of 
the same type (e.g., D-lysine in place of L-lysine) may also be used to generate more 



wo 



2004/050858 



PCT/US2003/038808 



31 

stable peptides. In addition, constrained peptides comprising a consensus sequence or a 
substantially identical consensus sequence variation may be generated by methods known 
in the art (Rizo et ai t Ann. Rev. Biochem. 61:387-418 (1992)). For example, one may add 
internal cysteine residues capable of forming intramolecular disulfide bridges which 
5 cyclize the peptide. 

The term "mutant" or "mutein" when referring to a polypeptide of the present 
invention relates to an amino acid sequence containing substitutions, insertions or 
deletions of one or more amino acids compared to the amino acid sequence of a CSP. A 
mutein may have one or more amino acid point substitutions, in which a single amino acid 

10 at a position has been changed to another amino acid, one or more insertions and/or 
deletions, in which one or more amino acids are inserted or deleted, respectively, in the 
sequence of the naturally occurring protein, and/or truncations of the amino acid sequence 
at either or both the amino or carboxy termini. Further, a mutein may have the same or 
different biological activity as the naturally occurring protein. For instance, a mutein may 

15 have an increased or decreased biological activity. A mutein has at least 50% sequence 
similarity to the wild type protein, preferred is 60% sequence similarity, more preferred is 
70% sequence similarity. Even more preferred are muteins having 80%, 85% or 90% 
sequence similarity to a CSP. In an even more preferred embodiment, a mutein exhibits 
95% sequence identity, even more preferably 97%, even more preferably 98% and even 

20 more preferably 99%. Sequence similarity may be measured by any common sequence 
analysis algorithm, such as GAP or BESTFIT or other variation Smith-Waterman 
alignment See, T. F. Smith and M. S. Waterman, J. Mol. Biol. 147:195-197 (1981) and 
W.R. Pearson, Genomics 11:635-650 (1991). 

Preferred amino acid substitutions are those which: (1) reduce susceptibility to 

25 proteolysis, (2) reduce susceptibility to oxidation, (3) alter binding affinity for forming 
protein complexes, (4) alter binding affinity or enzymatic activity, and (5) confer or 
modify other physicochemical or functional properties of such analogs. For example, 
single or multiple amino acid substitutions (preferably conservative amino acid 
substitutions) may be made in the naturally occurring sequence (preferably in the portion 

30 of the polypeptide outside the domain(s) forming intermolecular contacts. In a preferred 
embodiment, the amino acid substitutions are moderately conservative substitutions or 
conservative substitutions. In a more preferred embodiment, the amino acid substitutions 
are conservative substitutions. A conservative amino acid substitution should not 
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substantially change the structural characteristics of the parent sequence (e.g., a 
replacement amino acid should not tend to disrupt a helix that occurs in the parent 
sequence, or disrupt other types of secondary structure that characterize the parent 
sequence). Examples of art-recognized polypeptide secondary and tertiary structures are 
5 described in Creighton (ed.), Proteins, Structures and Molecular Principles, W. H. 
Freeman and Company (1984); Branden et al (ed.), Introduction to Protein Structure. 
Garland Publishing (1991); Thornton^ al, Nature 354:105-106 (1991). " - 

As used herein, the twenty conventional amino acids and their abbreviations follow 
conventional usage. See Golub et al (eds.), Immunology - A Synthesis 2 nd Ed., Sinauer 
10 Associates (1991). Stereoisomers {e.g., D-amino acids) of the twenty conventional amino 
acids, unnatural amino acids such as a-, a-disubstituted amino acids, N-alkyl amino acids, 
and other unconventional amino acids may also be suitable components for polypeptides 
of the present invention. Examples of unconventional amino acids include: 

4- hydroxyproline, v-carboxyglutamate, e-N,N,N-trimethyllysine, e-N-acetyllysine, 
15 O-phosphoserine, N-acetylserine, N-formylmethionine, 3-methylhistidine, 

5- hydroxylysine, s-N-methylarginine, and other similar amino acids and imino acids (e.g., 
4-hydroxyproline). In the polypeptide notation used herein, the lefthand direction is the 
amino terminal direction and the right hand direction is the carboxy-terminal direction, in 
accordance with standard usage and convention. 

20 By "homology" or "homologous" when referring to a polypeptide of the present 

invention it is meant polypeptides from different organisms with a similar sequence to 
the encoded amino acid sequence of a CSP and a similar biological activity or function. 
Although two polypeptides are said to be "homologous," this does not imply that there is 
necessarily an evolutionary relationship between the polypeptides. Instead, the term 

25 "homologous" is defined to mean that the two polypeptides have similar amino acid 
sequences and similar biological activities or functions. In a preferred embodiment, a 
homologous polypeptide is one that exhibits 50% sequence similarity to CSP, preferred is 
60% sequence similarity, more preferred is 70% sequence similarity. Even more preferred 
are homologous polypeptides that exhibit 80%, 85% or 90% sequence similarity to a CSP. 

30 In yet a more preferred embodiment, a homologous polypeptide exhibits 95%, 97%, 98% 
or 99% sequence similarity. 

When "sequence similarity" is used in reference to polypeptides, it is recognized 
that residue positions that are not identical often differ by conservative amino acid 
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substitutions. In a preferred embodiment, a polypeptide that has "sequence similarity" 
comprises conservative or moderately conservative amino acid substitutions. A 
"conservative amino acid substitution" is one in which an amino acid residue is substituted 
by another amino acid residue having a side chain (R group) with similar chemical 
5 properties (e.g. , charge or hydrophobicity). In general, a conservative amino acid 

substitution will not substantially change the functional properties of a protein. In cases 
where two or more amino acid sequences differ from each other by conservative 
substitutions, the percent sequence identity or degree of similarity may be adjusted 
upwards to correct for the conservative nature of the substitution. Means for making this 

10 adjustment are well known to those of skill in the art. See, e.g., Pearson, Methods Mol. 
Biol. 24:307-31 (1994). 

For instance, the following six groups each contain amino acids that are 
conservative substitutions for one another: 
1) Serine (S), Threonine (T); 

15 2 ) Aspartic Acid (D), Glutamic Acid (E); 

3) Asparagine (N), Glutamine (Q); 

4) Arginine (R), Lysine (K); 

5) Isoleucine (I), Leucine (L), Methionine (M), Alanine (A), Valine (V), and 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 

20 Alternatively, a conservative replacement is any change having a positive value in 

the PAM250 log-likelihood matrix disclosed in Gonnet et al, Science 256: 1443-45 
(1992). A "moderately conservative" replacement is any change having a nonnegative 
value in the PAM250 log-likelihood matrix. 

Sequence similarity for polypeptides, which is also referred to as sequence 

25 identity, is typically measured using sequence analysis software. Protein analysis software 
matches similar sequences using measures of similarity assigned to various substitutions, 
deletions and other modifications, including conservative amino acid substitutions. For 
instance, GCG contains programs such as "Gap" and "Bestfit" which can be used with 
default parameters to determine sequence homology or sequence identity between closely 

30 related polypeptides, such as homologous polypeptides from different species of 

organisms or between a wild type protein and a mutein thereof. See, e.g., GCG Version 
6.1. Other programs include FASTA, discussed supra. 
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A preferred algorithm when comparing a sequence of the invention to a database 
containing a large number of sequences from different organisms is the computer program 
BLAST, especially blastp or tblastn. See, e.g., Altschul et al. y J. Mol Biol 215: 403-410 
(1990); Altschul et al y Nucleic Acids Res. 25:3389-402 (1997). Preferred parameters for 
5 blastp are: 

Expectation value: 10 (default) 

Filter: seg (default) 

Cost to open a gap: 1 1 (default) 

Cost to extend a gap: 1 (default 
10 Max. alignments: 100 (default) 

Word size: 1 1 (default) 

No. of descriptions: 100 (default) 

Penalty Matrix: BLOSUM62 

The length of polypeptide sequences compared for homology will generally be at 

15 least about 16 amino acid residues, usually at least about 20 residues, more usually at least 
about 24 residues, typically at least about 28 residues, and preferably more than about 35 
residues. When searching a database containing sequences from a large number of 
different organisms, it is preferable to compare amino acid sequences. 

Algorithms other than blastp for database searching using amino acid sequences 

20 are known in the art For instance, polypeptide sequences can be compared using FASTA, 
a program in GCG Version 6.1. FASTA {e.g., FASTA2 and FASTA3) provides 
alignments and percent sequence identity of the regions of the best overlap between the 
query and search sequences (Pearson (1990), supra; Pearson (2000), supra. For example, 
percent sequence identity between amino acid sequences can be determined using FASTA 

25 with its default or recommended parameters (a word size of 2 and the PAM250 scoring 
matrix), as provided in GCG Version 6.1. 

An "antibody" refers to an intact immunoglobulin, or to an antigen-binding portion 
thereof that competes with the intact antibody for specific binding to a molecular species, 
e.g., a polypeptide of the instant invention. Antigen-binding portions may be produced by 

30 recombinant DNA techniques or by enzymatic or chemical cleavage of intact antibodies. 
Antigen-binding portions include, inter alia, Fab, Fab', F(ab')2, Fv, dAb, and 
complementarity determining region (CDR) fragments, single-chain antibodies (scFv), 
chimeric antibodies, diabodies and polypeptides that contain at least a portion of an 
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immunoglobulin that is sufficient to confer specific antigen binding to the polypeptide. A 
Fab fragment is a monovalent fragment consisting of the VL, VH, CL and CHI domains; 
a F(ab') 2 fragment is a bivalent fragment comprising two Fab fragments linked by a 
disulfide bridge at the hinge region; a Fd fragment consists of the VH and CHI domains; a 
5 Fv fragment consists of the VL and VH domains of a single arm of an antibody; and a dAb 
fragment consists of a VH domain. See, e.g., Ward et ai 9 Nature 341: 544-546 (1989). 

By "bind specifically" and "specific binding 51 as used herein it is meant the ability 
of the antibody to bind to a first molecular species in preference to binding to other 
molecular species with which the antibody and first molecular species are admixed. An 

10 antibody is said to "recognize" a first molecular species when it can bind specifically to 
that first molecular species. 

A single-chain antibody (scFv) is an antibody in which VL and VH regions are 
paired to form a monovalent molecule via a synthetic linker that enables them to be made 
as a single protein chain. See, e.g., Bird et al., Science 242: 423-426 (1988); Huston et al., 

15 Proc. Natl Acad. Set USA 85: 5879-5883 (1988). Diabodies are bivalent, bispecific 

antibodies in which VH and VL domains are expressed on a single polypeptide chain, but 
using a linker that is too short to allow for pairing between the two domains on the same 
chain, thereby forcing the domains to pair with complementary domains of another chain 
and creating two antigen binding sites. See e.g., Holliger et al., Proc. Natl. Acad. Sci. USA 

20 90: 6444-6448 (1993); Poljak et aL t Structure 2: 1 121-1 123 (1994). One or more CDRs 
may be incorporated into a molecule either covalently or noncovalently to make it an 
immunoadhesin. An immunoadhesin may incorporate the CDR(s) as part of a larger 
polypeptide chain, may covalently link the CDR(s) to another polypeptide chain, or may 
incorporate the CDR(s) noncovalently. The CDRs permit the immunoadhesin to 

25 specifically bind to a particular antigen of interest. A chimeric antibody is an antibody 
that contains one or more regions from one antibody and one or more regions from one or 
more other antibodies. 

An antibody may have one or more binding sites. If there is more than one binding 
site, die binding sites may be identical to one another or may be different. For instance, a 

30 naturally occurring immunoglobulin has two identical binding sites, a single-chain 
antibody or Fab fragment has one binding site, while a "bispecific" or "Afunctional" 
antibody has two different binding sites. 
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An "isolated antibody 5 ' is an antibody that (1) is not associated with naturally- 
associated components, including other naturally-associated antibodies, that accompany it 
in its native state, (2) is free of other proteins from the same species, (3) is expressed by a 
cell from a different species, or (4) does not occur in nature. It is known that purified 

5 proteins, including purified antibodies, may be stabilized with non-naturally-associated 
components. The non-naturally-associated component may be a protein, such as albumin 
(e.g., BSA) or a chemical such as polyethylene glycol (PEG). 

A "neutralizing antibody" or "an inhibitory antibody" is an antibody that inhibits 
the activity of a polypeptide or blocks the binding of a polypeptide to a ligand that 

10 normally binds to it. An "activating antibody" is an antibody that increases the activity of 
a polypeptide. 

The term "epitope" includes any protein determinant capable of specific binding to 
an immunoglobulin or T-cell receptor. Epitopic determinants usually consist of 
chemically active surface groupings of molecules such as amino acids or sugar side chains 

15 and usually have specific three-dimensional structural characteristics, as well as specific 
charge characteristics. An antibody is said to specifically bind an antigen when the 
dissociation constant is less thanl pM, preferably less thanlOO nM and most preferably 
less than 10 nM. 

The term "patient" includes human and veterinary subjects. 

20 Throughout this specification and claims, the word "comprise," or variations such 

as "comprises" or "comprising," will be understood to imply the inclusion of a stated 
integer or group of integers but not the exclusion of any other integer or group of integers. 

The term "colon specific" refers to a nucleic acid molecule or polypeptide that is 
expressed predominantly in the colon as compared to other tissues in the body. In a 

25 preferred embodiment, a "colon specific" nucleic acid molecule or polypeptide is detected 
at a level that is 1.5-fold higher than any other tissue in the body. In a more preferred 
embodiment, the "colon specific" nucleic acid molecule or polypeptide is detected at a 
level that is 2-fold higher than any other tissue in the body, more preferably 5-fold higher, 
still more preferably at least 10-fold, 15-fold, 20-fold, 25-fold, 50-fold or 100-fold higher 

30 than any other tissue in the body. Nucleic acid molecule levels may be measured by 
nucleic acid hybridization, such as Northern blot hybridization, or quantitative PGR. 
Polypeptide levels may be measured by any method known to accurately quantitate 
protein levels, such as Western blot analysis. 
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Nucleic Acid Molecules, Regulatory Sequences, Vectors, Host Cells and Recombinant 
Methods of Making Polypeptides 
Nucleic Acid Molecules 

One aspect of the invention provides isolated nucleic acid molecules that are 
5 specific to the colon or to colon cells or tissue or that are derived from such nucleic acid 
molecules. These isolated colon specific nucleic acids (CSNAs) may comprise cDNA 
genomic DNA, RNA, or a combination thereof, a fragment of one of these nucleic acids, 
or may be a non-naturally occurring nucleic acid molecule. A CSNA may be derived from 
an animal. In a preferred embodiment, the CSNA is derived from a human or other 

10 mammal. In a more preferred embodiment, the CSNA is derived from a human or other 
primate. In an even more preferred embodiment, the CSNA is derived from a human. 

In a preferred embodiment, the nucleic acid molecule encodes a polypeptide that 
is specific to colon, a colon-specific polypeptide (CSP). In a more preferred embodiment, 
the nucleic acid molecule encodes a polypeptide that comprises an amino acid sequence of 

15 SEQ ID NO: 95-248. In another highly preferred embodiment, the nucleic acid molecule 
comprises a nucleic acid sequence of SEQ ED NO: 1-94. Nucleotide sequences of the 
instantly-described nucleic acid molecules were determined by assembling several DNA 
molecules from either public or proprietary databases. Some of the underlying DNA 
sequences are the result, directly or indirectly, of at least one enzymatic polymerization 

20 reaction (e.g., reverse transcription and/or polymerase chain reaction) using an automated 
sequencer (such as the MegaBACE™ 1000, Amersham Biosciences, Sunnyvale, CA, 
USA). 

Nucleic acid molecules of the present invention may also comprise sequences that 
selectively hybridize to a nucleic acid molecule encoding a CSNA or a complement or 

25 antisense thereof. The hybridizing nucleic acid molecule may or may not encode a 

polypeptide or may or may not encode a CSP. However, in a preferred embodiment, the 
hybridizing nucleic acid molecule encodes a CSP. In a more preferred embodiment, the 
invention provides a nucleic acid molecule that selectively hybridizes to a nucleic acid 
molecule or the antisense sequence of a nucleic acid molecule that encodes a polypeptide 

30 comprising an amino acid sequence of SEQ ED NO: 95-248. In an even more preferred 
embodiment, the invention provides a nucleic acid molecule that selectively hybridizes to 
a nucleic acid molecule comprising the nucleic acid sequence of SEQ ED NO: 1-94 or the 
antisense sequence thereof. Preferably, the nucleic acid molecule selectively hybridizes to 
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a nucleic acid molecule or the antisense sequence of a nucleic acid molecule encoding a 
CSP under low stringency conditions. More preferably, the nucleic acid molecule 
selectively hybridizes to a nucleic acid molecule or the antisense sequence of a nucleic 
acid molecule encoding a CSP under moderate stringency conditions. Most preferably, 
5 the nucleic acid molecule selectively hybridizes to a nucleic acid molecule or the antisense 
sequence of a nucleic acid molecule encoding a CSP under high stringency conditions. In 
a preferred embodiment, the nucleic acid molecule hybridizes under low, moderate or high 
stringency conditions to a nucleic acid molecule or the antisense sequence of a nucleic 
acid molecule encoding a polypeptide comprising an amino acid sequence of SEQ ID NO: 
10 95-248. In a more preferred embodiment, the nucleic acid molecule hybridizes under low, 
moderate or high stringency conditions to a nucleic acid molecule or the antisense 
sequence of a nucleic acid molecule comprising a nucleic acid sequence selected from 
SEQ ID NO: 1-94. 

Nucleic acid molecules of the present invention may also comprise nucleic acid 

15 sequences that exhibit substantial sequence similarity to a nucleic acid encoding a CSP or 
a complement of the encoding nucleic acid molecule. In this embodiment, it is preferred 
that the nucleic acid molecule exhibit substantial sequence similarity to a nucleic acid 
molecule encoding human CSP. More preferred is a nucleic acid molecule exhibiting 
substantial sequence similarity to a nucleic acid molecule encoding a polypeptide having 

20 an amino acid sequence of SEQ ID NO: 95-248. By substantial sequence similarity it is 
meant a nucleic acid molecule having at least 60%, more preferably at least 70%, even 
more preferably at least 80% and even more preferably at least 85% sequence identity 
with a nucleic acid molecule encoding a CSP, such as a polypeptide having an amino acid 
sequence of SEQ ID NO: 95-248. In a more preferred embodiment, the similar nucleic 

25 acid molecule is one that has at least 90%, more preferably at least 95%, more preferably 
at least 97%, even more preferably at least 98%, and still more preferably at least 99% 
sequence identity with a nucleic acid molecule encoding a CSP. Most preferred in this 
embodiment is a nucleic acid molecule that has at least 99.5%, 99.6%, 99.7%, 99.8% or 
99.9% sequence identity with a nucleic acid molecule encoding a CSP. 

30 The nucleic acid molecules of the present invention are also inclusive of those 

exhibiting substantial sequence similarity to a CSNA or its complement In this 
embodiment, it is preferred that the nucleic acid molecule exhibit substantial sequence 
similarity to a nucleic acid molecule having a nucleic acid sequence of SEQ ED NO: 1-94. 
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By substantial sequence similarity it is meant a nucleic acid molecule that has at least 
60%, more preferably at least 70%, even more preferably at least 80% and even more 
preferably at least 85% sequence identity with a CSNA, such as one having a nucleic acid 
sequence of SEQ ID NO: 1-94. More preferred is a nucleic acid molecule that has at least 
5 90%, more preferably at least 95%, more preferably at least 97%, even more preferably at 
least 98%, and still more preferably at least 99% sequence identity with a CSNA. Most 
preferred is a nucleic acid molecule that has at least 99.5%, 99.6%, 99.7%, 99.8% or 
99.9% sequence identity with a CSNA. 

Nucleic acid molecules that exhibit substantial sequence similarity are inclusive of 

10 sequences that exhibit sequence identity over their entire length to a CSNA or to a nucleic 
acid molecule encoding a CSP, as well as sequences that are similar over only a part of its 
length. In this case, the part is at least 50 nucleotides of the CSNA or the nucleic acid 
molecule encoding a CSP, preferably at least 100 nucleotides, more preferably at least 150 
or 200 nucleotides, even more preferably at least 250 or 300 nucleotides, still more 

1 5 preferably at least 400 or 500 nucleotides. 

The substantially similar nucleic acid molecule may be a naturally occurring one 
that is derived from another species, especially one derived from another primate, wherein 
the similar nucleic acid molecule encodes an amino acid sequence that exhibits significant 
sequence identity to that of SEQ ID NO: 95-248 or demonstrates significant sequence 

20 identity to the nucleotide sequence of SEQ ID NO: 1-94. The similar nucleic acid 

molecule may also be a naturally occurring nucleic acid molecule from a human, when the 
CSNA is a member of a gene family. The similar nucleic acid molecule may also be a 
naturally occurring nucleic acid molecule derived from a non-primate, mammalian 
species, including without limitation, domesticated species, e.g., dog, cat, mouse, rat, 

25 rabbit, hamster, cow, horse and pig; and wild animals, e.g., monkey, fox, lions, tigers, 
bears, giraffes, zebras, etc. The substantially similar nucleic acid molecule may also be a 
naturally occurring nucleic acid molecule derived from a non-mammalian species, such as 
birds or reptiles. The naturally occurring substantially similar nucleic acid molecule may 
be isolated directly from humans or other species. In another embodiment, the 

30 substantially similar nucleic acid molecule may be one that is experimentally produced by 
random mutation of a nucleic acid molecule. In another embodiment, the substantially 
similar nucleic acid molecule may be one that is experimentally produced by directed 
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mutation of a CSNA. In a preferred embodiment, the substantially similar nucleic acid 
molecule is a CSNA. 

The nucleic acid molecules of the present invention are also inclusive of allelic 
variants of a CSNA or a nucleic acid encoding a CSP. For example, single nucleotide 
5 polymorphisms (SNPs) occur frequently in eukaryotic genomes and the sequence 

determined from one individual of a species may differ from other allelic forms present 
within the population. More than 1.4 million SNPs have already been identified in the 
human genome, International Human Genome Sequencing Consortium, Nature 409: 860- 
921 (2001) - Variants with small deletions and insertions of more than a single nucleotide 

10 are also found in the general population, and often do not alter the function of the protein. 
In addition, amino acid substitutions occur frequently among natural allelic variants, and 
often do not substantially change protein function. 

In a preferred embodiment, the allelic variant is a variant of a gene, wherein the 
gene is transcribed into a mRNA that encodes a CSP. In a more preferred embodiment, 

15 the gene is transcribed into a mRNA that encodes a CSP comprising an amino acid 

sequence of SEQ ID NO: 95-248. In another preferred embodiment, the allelic variant is a 
variant of a gene, wherein the gene is transcribed into a mRNA that is a CSNA. In a more 
preferred embodiment, the gene is transcribed into a mRNA that comprises the nucleic 
acid sequence of SEQ ID NO: 1-94. Also preferred is that the allelic variant be a 

20 naturally occurring allelic variant in the species of interest, particularly human. 

Nucleic acid molecules of the present invention are also inclusive of nucleic acid 
sequences comprising a part of a nucleic acid sequence of the instant invention. The part 
may or may not encode a polypeptide, and may or may not encode a polypeptide that is a 
CSP. In a preferred embodiment, the part encodes a CSP. In one embodiment, the 

25 nucleic acid molecule comprises a part of a CSNA. In another embodiment, the nucleic 
acid molecule comprises a part of a nucleic acid molecule that hybridizes or exhibits 
substantial sequence similarity to a CSNA. In another embodiment, the nucleic acid 
molecule comprises a part of a nucleic acid molecule that is an allelic variant of a CSNA. 
In yet another embodiment, the nucleic acid molecule comprises a part of a nucleic acid 

30 molecule that encodes a CSP. A part comprises at least 10 nucleotides, more preferably at 
least 15, 17, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400 or 
500 nucleotides. The maximum size of a nucleic acid part is one nucleotide shorter than 
the sequence of the nucleic acid molecule encoding the full-length protein. 
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Nucleic acid molecules of the present invention are also inclusive of nucleic acid 
sequences that encode fusion proteins, homologous proteins, polypeptide fragments, 
muteins and polypeptide analogs, as described infra. 

Nucleic acid molecules of the present invention are also inclusive of nucleic acid 
5 sequences containing modifications of the native nucleic acid molecule. Examples of such 
modifications include, but are not limited to, normative internucleoside bonds, post- 
synthetic modifications or altered nucleotide analogues. One having ordinary skill in the 
art would recognize that the type of modification that may be made will depend upon the 
intended use of the nucleic acid molecule. For instance, when the nucleic acid molecule is 

10 used as a hybridization probe, the range of such modifications will be limited to those that 
permit sequence-discriminating base pairing of the resulting nucleic acid. When used to 
direct expression of RNA or protein in vitro or in vivo, the range of such modifications 
will be limited to those that permit the nucleic acid to function properly as a 
polymerization substrate. When the isolated nucleic acid is used as a therapeutic agent, 

15 the modifications will be limited to those that do not confer toxicity upon the isolated 
nucleic acid. 

Accordingly, in one embodiment, a nucleic acid molecule may include nucleotide 
analogues that incorporate labels that are directly detectable, such as radiolabels or 
fluorophores, or nucleotide analogues that incorporate labels that can be visualized in a 

20 subsequent reaction, such as biotin or various haptens. The labeled nucleic acid molecules 
are particularly useful as hybridization probes. 

Common radiolabeled analogues include those labeled with 33 P, 32 P, and 35 S, such 
as a- 32 P-dATP, a- 32 P-dCTP, a- 32 P-dGTP, a- 32 P-dTTP, a- 32 P-3'dATP, a- 32 P-ATP, a- 32 P- 
CTP, a- 32 P-GTP, a- 32 P-UTP, a- 35 S-dATP, y- 35 S-GTP, Y - 33 P-dATP, and the like. 

25 Commercially available fluorescent nucleotide analogues readily incorporated into 

the nucleic acids of the present invention include Cy3-dCTP, Cy3-dUTP, Cy5-dCTP, Cy3- 
dUTP (Amersham Biosciences, Piscataway, New Jersey, USA), fluorescein- 12-dUTP, 
tetramethylrhodamine-6-dUTP, Texas Red®-5-dUTP, Cascade Blue®-7-dUTP, 
BODIPY® FL-14-dUTP, BODIPY® TMR-14-dUTP, BODIPY® TR-14-dUTP, 

30 Rhodamine Green™-5-dUTP, Oregon Green® 488-5-dUTP, Texas Red®-12-dUTP, 

BODIPY® 630/650-14-dUTP, BODIPY® 650/665- 14-dUTP, Alexa Fluor® 488-5-dUTP, 
Alexa Fluor® 532-5-dUTP, Alexa Fluor® 568-5-dUTP, Alexa Fluorfg) 594-5-dUTP, 
Alexa Fluor® 546-14-dUTP, fluorescein- 12-UTP, tetramethylrhodamine-6-UTP, Texas 
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Red®-5-UTP, Cascade Blue®-7-UTP, BODIPY® FL-14-UTP, BODIPY® TMR-14-UTP, 
BODIPY® TR-14-UTP, Rhodamine Green™-5-UTP, Alexa Fluor® 488-5-UTP, Alexa 
Fluor® 546-14-UTP (Molecular Probes, Inc. Eugene, OR, USA). One may also custom 
synthesize nucleotides having other fluorophores. See Henegariu et aL, Nature 

5 Biotechnol 18: 345-348 (2000). 

Haptens that are commonly conjugated to nucleotides for subsequent labeling 
include biotin (biotin-ll-dUTP, Molecular Probes, Inc., Eugene, OR, USA; 
biotin-21-UTP, biotin-21-dUTP, Clontech Laboratories, Inc., Palo Alto, CA, USA), 
digoxigenin (DIG-1 1-dUTP, alkali labile, DIG-1 1-UTP, Roche Diagnostics Corp., 

10 Indianapolis, IN, USA), and dinitrophenyl (dinitrophenyl-i 1-dUTP, Molecular Probes, 
Inc., Eugene, OR, USA). 

Nucleic acid molecules of the present invention can be labeled by incorporation of 
labeled nucleotide analogues into the nucleic acid. Such analogues can be incorporated by 
enzymatic polymerization, such as by nick translation, random priming, polymerase chain 

1 5 reaction (PCR), terminal transferase tailing, and end-filling of overhangs, for DNA 

molecules, and in vitro transcription driven, e.g., from phage promoters, such as T7, T3, 
and SP6, for RNA molecules. Commercial kits are readily available for each such 
labeling approach. Analogues can also be incorporated during automated solid phase 
chemical synthesis. Labels can also be incorporated after nucleic acid synthesis, with the 

20 5' phosphate and 3' hydroxyl providing convenient sites for post-synthetic covalent 
attachment of detectable labels. 

Other post-synthetic approaches also permit internal labeling of nucleic acids. For 
example, fluorophores can be attached using a cisplatin reagent that reacts with the N7 of 
guanine residues (and, to a lesser extent, adenine bases) in DNA, RNA, and Peptide 

25 Nucleic Acids (PNA) to provide a stable coordination complex between the nucleic acid 
and fluorophore label (Universal Linkage System) (available from Molecular Probes, Inc., 
Eugene, OR, USA and Amersham Pharmacia Biotech, Piscataway, NJ, USA); see Alers et 
aU Genes, Chromosomes & Cancer 25: 301- 305 (1999); Jelsma et aL, J, NIHRes. 5: 82 
(1994); Van Belkum et aL, BioTechniques 16: 148-153 (1994). Alternatively, nucleic 

30 acids can be labeled using a disulfide-containing linker (FastTag™ Reagent, Vector 

Laboratories, Inc., Burlingame, CA, USA) that is photo- or thermally coupled to the target 
nucleic acid using aryl azide chemistry; after reduction, a free thiol is available for 
coupling to a hapten, fluorophore, sugar, affinity ligand, or other marker. 
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One or more independent or interacting labels can be incorporated into the nucleic 
acid molecules of the present invention. For example, both a fluorophore and a moiety 
that in proximity thereto acts to quench fluorescence can be included to report specific 
hybridization through release of fluorescence quenching or to report exonucleotidic 
5 excision. See, e.g., Tyagi et ai 9 Nature Biotechnol. 14: 303-308 (1996); Tyagi et ai, 
Nature Biotechnol 16: 49-53 (1998); Sokol etal, Proc. Natl Acad. Sci. USA 95: 
1 1538-1 1543 (1998); Kostrikis et al, Science 279: 1228-1229 (1998); Marras et ai, 
Genet. Anal. 14: 151-156 (1999); Holland et ai , Proc. NatL Acad. Sci. USA 88: 
7276-7280 (1991); Heid et al, Genome Res. 6(10): 986-94 (1996); Kuimeiis et ai, 

10 Nucleic Acids Symp. Ser. (37): 255-6 (1997); and U.S. Patent Nos. 5,846,726, 5,925,517, 
5,925,517, 5,723,591 and 5,538,848, the disclosures of which are incorporated herein by 
reference in their entireties. 

Nucleic acid molecules of the present invention may also be modified by altering 
one or more native phosphodiester internucleoside bonds to more nuclease-resistant, 

15 internucleoside bonds. See Hartmann et ai (eds.), Manual of Antisense Methodology: 
Perspectives in Antisense Science, Kluwer Law International (1999); Stein et al (eds.), 
Applied Antisense Oligonucleotide Technology, Wiley-Liss (1998); Chadwick et al 
(eds.), Oligonucleotides as Therapeutic Agents - Symposium No. 209 , John Wiley & Son 
Ltd (1997). Such altered internucleoside bonds are often desired for techniques or for 

20 targeted gene correction, Gamper et al, Nucl. Acids Res. 28(21): 4332-4339 (2000). For 
double-stranded RNA inhibition which may utilize either natural ds RNA or ds RNA 
modified in its, sugar, phosphate or base, see Hannon, Nature 418(1 1): 244-251 (2002); 
Fire et al in WO 99/32619; Tuschl et al in US2002/0086356; Kruetzer et al in WO 
00/44895, the disclosures of which are incorporated herein by reference in their entirety. 

25 For circular antisense, see Kool in U.S. Patent No. 5,426,180, the disclosure of which is 
incorporated herein by reference in its entirety. 

Modified oligonucleotide backbones include, without limitation, 
phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 
aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3'-aIkylene 

30 phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 

3 '-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, 
thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having 
normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity 
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wherein the adjacent pairs of nucleoside units are linked 3 5 -5' to 5*-3* or 2'-5' to 5'-2\ 
Representative U.S. Patents that teach the preparation of the above phosphorus-containing 

j linkages include, but are not limited to, U.S. Patent Nos. 3,687,808; 4,469,863; 4,476,301; 

5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 
5 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 

1 5,541,306; 5,550,1 1 1; 5,563,253; 5,571,799; 5,587,361; and 5,625,050, the disclosures of \ 

which are incorporated herein by reference in their entireties. In a preferred embodiment, 
the modified internucleoside linkages may be used for antisense techniques. 

Other modified oligonucleotide backbones do not include a phosphorus atom, but 
10 have backbones that are formed by short chain alkyl or cycloalkyl internucleoside 

linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or 
more short chain heteroatomic or heterocyclic internucleoside linkages. These include 
those having morpholino linkages (formed in part from the sugar portion of a nucleoside); 

) siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and 

15 thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene 
containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino 
backbones; sulfonate and sulfonamide backbones; amide backbones; and others having 
mixed N, O, S and CH 2 component parts. Representative U.S. patents that teach the 
preparation of the above backbones include, but are not limited to, U.S. Patent Nos. 

| 20 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 

5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 
5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 
5,633,360; 5,677,437 and 5,677,439; the disclosures of which are incorporated herein by 

j reference in their entireties. 

25 In other preferred nucleic acid molecules, both the sugar and the internucleoside 

linkage are replaced with novel groups, such as peptide nucleic acids (PNA). In PNA 
compounds, the phosphodiester backbone of the nucleic acid is replaced with an arnide- 

j containing backbone, in particular by repeating N-(2-aminoethyl) glycine units linked by 

| amide bonds. Nucleobases are bound directly or indirectly to aza nitrogen atoms of the 

30 amide portion of the backbone, typically by methylene carbonyl linkages. PNA can be 
synthesized using a modified peptide synthesis protocol. PNA oligomers can be 

S synthesized by both Fmoc and tBoc methods. Representative U.S. patents that teach the 

preparation of PNA compounds include, but are not limited to, U.S. Patent Nos. 
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5,539,082; 5,714,33 1; and 5,719,262, each of which is herein incorporated by reference in 
its entirety. Automated PNA synthesis is readily achievable on commercial synthesizers 
(see, e.g., "PNA User's Guide," Rev. 2, February 1998, Perseptive Biosystems Part No. 
I 60138, Applied Biosystems, Inc., Foster City, CA). PNA molecules are advantageous for 

I 5 a number of reasons. First, because the PNA backbone is uncharged, PNA/DNA and 

j PNA/RNA duplexes have a higher thermal stability than is found in DNA/DNA and 

DNA/RNA duplexes. The Tm of a PNA/DNA or PNA/RNA duplex is generally 1°C 
higher per base pair than the Tm of the corresponding DNA/DNA or DNA/RNA duplex 
(in 100 mM NaCl). Second, PNA molecules can also form stable PNA/DNA complexes 
j 10 at low ionic strength, under conditions in which DNA/DNA duplex formation does not 

occur. Third, PNA also demonstrates greater specificity in binding to complementary 
DNA because a PNA/DNA mismatch is more destabilizing than DNA/DNA mismatch. A 

i single mismatch in mixed a PNA/DNA 15-mer lowers the Tm by 8-20°C (15°C on 

i 

j average). In the corresponding DNA/DNA duplexes, a single mismatch lowers the Tm by 

j 15 4-16°C (1 1°C on average). Because PNA probes can be significantly shorter than DNA 

| probes, their specificity is greater. Fourth, PNA oligomers are resistant to degradation by 

enzymes, and the lifetime of these compounds is extended both in vivo and in vitro 
because nucleases and proteases do not recognize the PNA polyamide backbone with 
nucleobase sidechains. See, e.g., Ray et al, FASEB J. 14(9): 1041-60 (2000); Nielsen et 
20 al, Pharmacol Toxicol. 86(1): 3-7 (2000); Larsen et al, Biochim Biophys Acta. 1489(1): 
159-66 (1999); Nielsen, Carr. Opin. Struct. Biol 9(3): 353-7 (1999), and Nielsen, Curr. 
Opin. Biotechnol 10(1): 71-5 (1999). 

Nucleic acid molecules may be modified compared to their native structure 
throughout the length of the nucleic acid molecule or can be localized to discrete portions 
25 thereof. As an example of the latter, chimeric nucleic acids can be synthesized that have 
| discrete DNA and RNA domains and that can be used for targeted gene repair and 

; modified PCR reactions, as further described in, Misra et al, Biochem. 37: 1917-1925 

. j (1998); and Finn et al, Nucl Acids Res. 24: 3357-3363 (1996), and U.S. Patent Nos. 

j ; 5,760,012 and 5,731,181, the disclosures of which are incorporated herein by reference in 

j 30 their entireties. 

1 Unless otherwise specified, nucleic acid molecules of the present invention can 

j include any topological conformation appropriate to the desired use; the term thus 

■ explicitly comprehends, among others, single-stranded, double-stranded, triplexed, 
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quadruplexed, partially double-stranded, partially-triplexed, partially-quadruplexed, 
branched, hairpinned, circular, and padlocked conformations. Padlocked conformations 
and their utilities are further described in Baner et a/., Curr. Opin. Biotechnol. 12: 11-15 
(2001); Escude etal,Proc. Natl Acad. Set USA 14: 96(19): 10603-7 (1999); andNilsson 
5 et al, Science 265(5181): 2085-8 (1994). Triplexed and quadruplexed conformations, and 
their utilities, are reviewed in Praseuth et al, Biochim. Biophys. Acta. 1489(1): 181-206 

(1999) ; Fox, Cwr. Med. Chan. 7(1): 17-37 (2000); Kochetkova et al, Methods Mol. Biol 
130: 189-201 (2000); Chan et al, J. Mol Med. 75(4): 267-82 (1997); Rowley et al, Mol 
Med 5(10): 693-700 (1999); Kool, Annu Rev Biophys Biomol Struct. 25: 1-28 (1996). 

10 SNP Polymorphisms 

Commonly, sequence differences between individuals involve differences in single 
nucleotide positions. SNPs may account for 90% of human DNA polymorphism. Collins 
et al, 8 Genome Res. 1229-3 1 (1998). SNPs include single base pair positions in genomic 
DNA at which different sequence alternatives (alleles) exist in a population. In addition, 

1 5 the least frequent allele generally must occur at a frequency of 1% or greater. DNA 
sequence variants with a reasonably high population frequency are observed 
approximately every 1,000 nucleotide across the genome, with estimates as high as 1 SNP 
per 350 base pairs. Wang et at, 280 Science 1077-82 (1998); Harding et al, 60 Am. J. 
Human Genet. 772-89 (1997); Taillon-Miller et al, 8 Genome Res. 748-54 (1998); Cargill 

20 et al, 22 Nat Genet 231-38 (1999); and Semple et al, 16 Bioinform. Disc. Note 735-38 

(2000) . The frequency of SNPs varies with the type and location of the change. In base 
substitutions, two-thirds of the substitutions involve the C-T and G-A type. This variation 
in frequency can be related to 5-methylcytosine deamination reactions that occur 
frequently, particularly at CpG dinucleotides. Regarding location, SNPs occur at a much 

25 higher frequency in non-coding regions than in coding regions. Information on over one 
million variable sequences is already publicly available via the Internet and more such 
markers are available from commercial providers of genetic information. Kwok and Gu, 5 
Med. Today 538-53 (1999). 

Several definitions of SNPs exist. See, e.g., Brooks, 235 Gene 177-86 (1999). As 

30 used herein, the term "single nucleotide polymorphism" or "SNP" includes all single base 
variants, thus including nucleotide insertions and deletions in addition to single nucleotide 
substitutions. There are two types of nucleotide substitutions. A transition is the 



WO 2004/050858 



PCT/US2003/038808 



47 

replacement of one purine by another purine or one pyrimidine by another pyrimidine. A 
transversion is the replacement of a purine for a pyrimidine, or vice versa. 

Numerous methods exist for detecting SNPs within a nucleotide sequence. A 
review of many of these methods can be found in Landegren et aL, 8 Genome Res. 769-76 
5 (1998). For example, a SNP in a genomic sample can be detected by preparing a Reduced 
Complexity Genome (RCG) from the genomic sample, then analyzing the RCG for the 
presence or absence of a SNP. See, e.g., WO 00/18960 which is herein incorporated by 
reference in its entirety. Multiple SNPs in a population of target polynucleotides in parallel 
can be detected using, for example, the methods of WO 00/50869 which is herein 

10 incorporated by reference in its entirety. Other SNP detection methods include the 
methods of U.S. Pat. Nos. 6,297,018 and 6,322,980 which are herein incorporated by 
reference in their entirety. Furthermore, SNPs can be detected by restriction fragment 
length polymorphism (RFLP) analysis. See, e.g., U.S. Pat Nos. 5,324,631; 5,645,995 
which are herein incorporated by reference in their entirety. RFLP analysis of SNPs, 

15 however, is limited to cases where the SNP either creates or destroys a restriction enzyme 
cleavage site. SNPs can also be detected by direct sequencing of the nucleotide sequence 
of interest. In addition, numerous assays based on hybridization have also been developed 
to detect SNPs and mismatch distinction by polymerases and ligases. Several web sites 
provide information about SNPs including Ensembl on the World Wide Web at 

20 ensemble.org, Sanger Institute on the World Wide Web at sanger.ac.uk/genetics/exon/, 
National Center for Biotechnology Information (NCBI) on the World Wide Web at 
ncbi.nlm.nih.gov/SNP/ , The SNP Consortium Ltd. on the World Wide Web at 
snp.cshl.org. The chromosomal locations for the compositions disclosed herein are 
provided below. In addition, one of ordinary skill in the art could use a BLAST against 

25 the genome or any of the databases cited above to find the chromosomal location. 

Another a preferred method to find the genomic coordinates and associated SNPs would 
be to use the BLAT tool (genome.ucsc.edu, Kent et al. 2001, The Human Genome 
Browser at UCSC, Genome Research 996-1006 or Kent 2002 BLAT —The BLAST -Like 
Alignment Tool Genome Reseach, 1-9 ). All web sites above were accessed December 3, 

30 2003. 
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RNA interference refers to the process of sequence-specific post transcriptional 
gene silencing in animals mediated by short interfering RNAs (siRNA). Fire et ai, 1998, 
Nature, 391, 806. The corresponding process in plants is commonly referred to as post 
transcriptional gene silencing or RNA silencing and is also referred to as quelling in fungi. 

5 The process of post transcriptional gene silencing is thought to be an evolutionarily 
conserved cellular defense mechanism used to prevent the expression of foreign genes 
which is commonly shared by diverse flora and phyla. Fire et al , 1999, Trends Genet. , 
15, 358. Such protection from foreign gene expression may have evolved in response to 
the production of double-stranded RNAs (dsRNA) derived from viral infection or the 

10 random integration of transposon elements into a host genome via a cellular response that 
specifically destroys homologous single-stranded RNA or viral genomic RNA. The 
presence of dsRNA in cells triggers the RNAi response though a mechanism that has yet 
to be fully characterized. This mechanism appears to be different from the interferon 
response that results from dsRNA mediated activation of protein kinase PKR and 2\5- 

15 oligoadenylate synthetase resulting in non-specific cleavage of mRNA by ribonuclease L. 

The presence of long dsRNAs in cells stimulates the activity of a ribonuclease III 
enzyme referred to as dicer. Dicer is involved in the processing of the dsRNA into short 
pieces of dsRNA known as short interfering RNAs (siRNA). Berstein et aL 9 2001, 
Nature, 409, 363. Short interfering RNAs derived from dicer activity are typically about 

20 21-23 nucleotides in length and comprise about 19 base pair duplexes. Dicer has also been 
implicated in the excision of 21 and 22 nucleotide small temporal RNAs (stRNA) from 
precursor RNA of conserved structure that are implicated in translational control. 
Hutvagner et ai, 2001, Science, 293, 834. The RNAi response also features an 
endonuclease complex containing a siRNA, commonly referred to as an RNA-induced 

25 silencing complex (RISC), which mediates cleavage of single stranded RNA having 
sequence complementary to the antisense strand of the siRNA duplex. Cleavage of the 
target RNA takes place in the middle of the region complementary to the antisense strand 
of the siRNA duplex. Elbashir^a/., 2001, Genes Dev., 15, 188. 

Short interfering RNA mediated RNAi has been studied in a variety of systems. 
30 Fire et al, 1998, Nature, 391, 806, were the first to observe RNAi in C. Elegans. Wianny 
and Goetz, 1999, Nature Cell Biol, 2, 70, describe RNAi mediated by dsRNA in mouse 
embryos. Hammond et al, 2000, Nature, 404, 293, describe RNAi in Drosophila cells 
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transfected with dsRNA. Elbashir et al y 2001 , Nature, 411, 494, describe RNAi induced 
by introduction of duplexes of synthetic 21 -nucleotide RNAs in cultured mammalian cells 
including human embryonic kidney and HeLa cells. Recent work in Drosophila embryonic 
lysates (Elbashir et ai, 2001, EMBOJ., 20, 6877) has revealed certain requirements for 
5 siRNA length, structure, chemical composition, and sequence that are essential to mediate 
efficient RNAi activity. These studies have shown that 21 nucleotide siRNA duplexes are 
most active when containing two nucleotide 3 -overhangs. Furthermore, complete 
substitution of one or both siRNA strands with 2 f -deoxy (2'-H) or 2 -O-methyl nucleotides 
abolishes RNAi activity, whereas substitution of the ^-terminal siRNA overhang 

10 nucleotides with deoxy nucleotides (2'-H) was shown to be tolerated. Single mismatch 

sequences in the center of the siRNA duplex were also shown to abolish RNAi activity. In 
addition, these studies also indicate that the position of the cleavage site in the target RNA 
is defined by the 5 f -end of the siRNA guide sequence rather than the 3'-end. Elbashir et 
al., 2001, EMBOJ., 20, 6877. Other studies have indicated that a 5'-phosphate on the 

15 target-complementary strand of a siRNA duplex is required for siRNA activity and that 
ATP is utilized to maintain the 5-phosphate moiety on the siRNA. Nykanen et aL, 2001, 
Cell, 107, 309. 

Studies have shown that replacing the 3 '-overhanging segments of a 21-mer siRNA 
duplex having 2 nucleotide 3 f overhangs with deoxyribonucleotides does not have an 

20 adverse effect on RNAi activity. Replacing up to 4 nucleotides on each end of the siRNA 
with deoxyribonucleotides has been reported to be well tolerated whereas complete 
substitution with deoxyribonucleotides results in no RNAi activity. Elbashir et al, 2001, 
EMBO 20, 6877. In addition, Elbashir et al., supra, also report that substitution of 
siRNA with 2 , -0-methyl nucleotides completely abolishes RNAi activity. Li et al., WO 

25 00/44914, and Beach et al., WO 01/68836 both suggest that siRNA "may include 

modifications to either the phosphate-sugar back bone or the nucleoside to include at least 
one of a nitrogen or sulfur heteroatom", however neither application teaches to what extent 
these modifications are tolerated in siRNA molecules nor provide any examples of such 
modified siRNA. Kreutzer and Limmer, Canadian Patent Application No. 2,359,180, also 

30 describe certain chemical modifications for use in dsRNA constructs in order to counteract 
activation of double stranded RNA-dependent protein kinase PKR, specifically 2 f -amino 
or 2'-0-methyl nucleotides, and nucleotides containing a 2-0 or 4-C methylene bridge. 
However, Kreutzer and Limmer similarly fail to show to what extent these modifications 
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are tolerated in siRNA molecules nor do they provide any examples of such modified 
siRNA. 

Parrish et aL, 2000, Molecular Cell, 6, 1977-1087, tested certain chemical 
modifications targeting the unc-22 gene in C. elegans using long (>25 nt) siRNA 
5 transcripts. The authors describe the introduction of thiophosphate residues into these 
siRNA transcripts by incorporating thiophosphate nucleotide analogs with T7 and T3 
RNA polymerase and observed that "RNAs with two [phosphorothioate] modified bases 
also had substantial decreases in effectiveness as RNAi triggers; [phosphorothioate] 
modification of more than two residues greatly destabilized the RNAs in vitro and we 
; 10 were not able to assay interference activities." Parrish et aL at 108 1 . The authors also 

tested certain modifications at the 2 f -position of the nucleotide sugar in the long siRNA 
transcripts and observed that substituting deoxynucleotides for ribonucleotides "produced 

i 

i a substantial decrease in interference activity", especially in the case of Uridine to 

Thymidine and/or Cytidine to deoxy-Cytidine substitutions. Parrish et al. In addition, the 
1 5 authors tested certain base modifications, including substituting 4-thiouracil, 5- 

bromouracil, 5-iodouracil, 3-(aminoallyl)uracil for uracil, and inosine for guanosine in 
sense and antisense strands of the siRNA, and found that whereas 4-thiouracil and 5- 
bromouracil were all well tolerated, inosine "produced a substantial decrease in 
: interference activity" when incorporated in either strand. Incorporation of 5-iodouracil and 

• 20 3-(aminoallyl)uracil in the antisense strand resulted in substantial decrease in RNAi 

activity as well. 

; ; ! Beach et al., WO 01/68836, describes specific methods for attenuating gene 

expression using endogenously derived dsRNA. Tuschl et aL, WO 01/75164, describes a 
- J Drosophila in vitro RNAi system and the use of specific siRNA molecules for certain 

25 functional genomic and certain therapeutic applications; although Tuschl, 2001, Chem. 
Biochem., 2, 239-245, doubts that RNAi can be used to cure genetic diseases or viral 
j infection due "to the danger of activating interferon response". Li et al., WO 00/44914, 

| describes the use of specific dsRNAs for use in attenuating the expression of certain target 

i 

| genes. Zernicka-Goetz et aL, WO 01/36646, describes certain methods for inhibiting the 

: j 30 expression of particular genes in mammalian cells using certain dsRNA molecules. Fire et 

al., WO 99/32619, U.S. Patent No. 6,506,559, the contents of which are hereby 
incorporated by reference in their entirety, describes particular methods for introducing 
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certain dsRNA molecules into cells for use in inhibiting gene expression. Plaetinck et al., 
WO 00/01846, describes certain methods for identifying specific genes responsible for 
conferring a particular phenotype in a cell using specific dsRNA molecules. Mello et al., 
WO 01/29058, describes the identification of specific genes involved in dsRNA mediated 
5 RNAi. Deschamps Depaillette et al., International PCT Publication No. WO 99/07409, 
describes specific compositions consisting of particular dsRNA molecules combined with 
certain anti-viral agents. Driscoll et aL, International PCT Publication No. WO 01/49844, 
describes specific DNA constructs for use in facilitating gene silencing in targeted 
organisms. Parrish et al., 2000, Molecular Cell, 6, 1977-1087, describes specific 
10 chemically modified siRNA constructs targeting the unc-22 gene of C. elegans. Tuschl et 
al, International PCT Publication No. WO 02/44321, describe certain synthetic siRNA 
constructs. 

Methods for Using Nucleic Acid Molecules as Probes and Primers 
The isolated nucleic acid molecules of the present invention can be used as 
15 hybridization probes to detect, characterize, and quantify hybridizing nucleic acids in, and 
isolate hybridizing nucleic acids from, both genomic and transcript-derived nucleic acid 
samples. When free in solution, such probes are typically, but not invariably, detectably 
labeled; bound to a substrate, as in a microatray, such probes are typically, but not 
invariably unlabeled. 

20 In one embodiment, the isolated nucleic acid molecules of the present invention 

can be used as probes to detect and characterize gross alterations in the gene of a CSNA, 
such as deletions, insertions, translocations, and duplications of the CSNA genomic locus 
through fluorescence in situ hybridization (FISH) to chromosome spreads. See, e.g., 
Andreeff et al. (eds.), Introduction to Fluorescence In Situ Hybridization: Principles and 

25 Clinical Applications. John Wiley & Sons (1999). The isolated nucleic acid molecules of 
the present invention can be used as probes to assess smaller genomic alterations using, 
e.g., Southern blot detection of restriction fragment length polymorphisms. The isolated 
nucleic acid molecules of the present invention can be used as probes to isolate genomic 
clones that include a nucleic acid molecule of the present invention, which thereafter can 

30 be restriction mapped and sequenced to identify deletions, insertions, translocations, and 
substitutions (single nucleotide polymorphisms, SNPs) at the sequence level. 
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Alternatively, detection techniques such as molecular beacons may be used, see Kostrikis 
et al Science 279:1228-1229 (1998). 

The isolated nucleic acid molecules of the present invention can also be used as 
probes to detect, characterize, and quantify CSNA in, and isolate CSNA from, transcript- 

5 derived nucleic acid samples. In one embodiment, the isolated nucleic acid molecules of 
the present invention can be used as hybridization probes to detect, characterize by length, 
and quantify mRNA by Northern blot of total or poly-A + - selected RNA samples. In 
another embodiment, the isolated nucleic acid molecules of the present invention can be 
used as hybridization probes to detect, characterize by location, and quantify mRNA by in 

10 situ hybridization to tissue sections. See, e.g., Schwarchzacher et al, In Situ 

Hybridization , Springer-Verlag New York (2000). In another preferred embodiment, the 
isolated nucleic acid molecules of the present invention can be used as hybridization 
probes to measure the representation of clones in a cDNA library or to isolate hybridizing 
nucleic acid molecules acids from cDNA libraries, permitting sequence level 

1 5 characterization of mRNAs that hybridize to CSNAs, including, without limitations, 

identification of deletions, insertions, substitutions, truncations, alternatively spliced forms 
and single nucleotide polymorphisms. In yet another preferred embodiment, the nucleic 
acid molecules of the instant invention may be used in microarrays. 

All of the aforementioned probe techniques are well within the skill in the art, and 

20 are described at greater length in standard texts such as Sambrook (2001), supra; Ausubel 
(1999), supra; and Walker et al (eds.), The Nucleic Acids Protocols Handbook , Humana 

» 

Press (2000). 

In another embodiment, a nucleic acid molecule of the invention may be used as a 
probe or primer to identify and/or amplify a second nucleic acid molecule that selectively 

25 hybridizes to the nucleic acid molecule of the invention. In this embodiment, it is 

preferred that the probe or primer be derived from a nucleic acid molecule encoding a 
CSP. More preferably, the probe or primer is derived from a nucleic acid molecule 
encoding a polypeptide having an amino acid sequence of SEQ ID NO: 95-248. Also 
preferred are probes or primers derived from a CSNA. More preferred are probes or 

30 primers derived from a nucleic acid molecule having a nucleotide sequence of SEQ ID 
NO: 1-94. 

In general, a probe or primer is at least 10 nucleotides in length, more preferably at 
least 12, more preferably at least 14 and even more preferably at least 16 or 17 nucleotides 
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in length. In an even more preferred embodiment, the probe or primer is at least 18 
nucleotides in length, even more preferably at least 20 nucleotides and even more 
preferably at least 22 nucleotides in length. Primers and probes may also be longer in 
length. For instance, a probe or primer may be 25 nucleotides in length, or may be 30, 40 
5 or 50 nucleotides in length. Methods of performing nucleic acid hybridization using 

oligonucleotide probes are well known in the art. See, e.g., Sambrook et al, 1989, supra, 
Chapter 11 and pp. 11.31-11.32 and 11.40-11.44, which describes radiolabeling of short 
probes, and pp. 1 1.45-1 1.53, which describe hybridization conditions for oligonucleotide 
probes, including specific conditions for probe hybridization (pp. 1 1 .50-1 1.51). 
1 0 Methods of performing primer-directed amplification are also well known in the 

art. Methods for performing the polymerase chain reaction (PCR) are compiled, inter alia, 
in McPherson, PCR Basics: From Background to Bench. Springer Verlag (2000); Innis et 
al. (eds.), PCR Applic ations: Protocols for Functional Genomics . Academic Press (1999); 
Gelfand et al (eds.), PCR Strategies. Academic Press (1998); Newton et al, PCR . 
15 Springer-Verlag New York (1997); Burke (ed.), PCR: Essential Techniques . John Wiley 
& Son Ltd (1996); White (ed.), PCR Cloning Protocols: From Molecular Cloning to 
Genetic Engineering, Vol. 67, Humana Press (1996); and McPherson et al (eds.), PCR 2: 
A Practical Approach, Oxford University Press, Inc. (1995). Methods for performing RT- 
PCR are collected, e.g., in Siebert et al (eds.), Gene Cloning and Analysis by RT-PCR. 
20 Eaton Publishing Company/Bio Techniques Books Division, 1998; and Siebert (ed.), PCR 
Technique:RT-PCR, Eaton Publishing Company/ BioTechniques Books (1995). 

PCR and hybridization methods may be used to identify and/or isolate nucleic acid 
molecules of the present invention including allelic variants, homologous nucleic acid 
molecules and fragments. PCR and hybridization methods may also be used to identify, 
25 amplify and/or isolate nucleic acid molecules of the present invention that encode 

homologous proteins, analogs, fusion protein or muteins of the invention. Nucleic acid 
primers as described herein can be used to prime amplification of nucleic acid molecules 
of the invention, using transcript-derived or genomic DNA as the template. 

These nucleic acid primers can also be used, for example, to prime single base 
30 extension (SBE) for SNP detection (See, e.g., U.S. Pat. No. 6,004,744, the disclosure of 
which is incorporated herein by reference in its entirety). 

Isothermal amplification approaches, such as rolling circle amplification, are also 
now well-described. See, e.g., Schweitzer et al, Curr. Opin. Biotechnol 12(1): 21-7 
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(2001); International Patent publications WO 97/19193 and WO 00/15779, and U.S. 
Patent Nos. 5,854,033 and 5,714,320, the disclosures of which are incorporated herein by 
reference in their entireties. Rolling circle amplification can be combined with other 
techniques to facilitate SNP detection. See, e.g., Lizardi et al., Nature Genet. 19(3): 
5 225-32 (1998). 

Nucleic acid molecules of the present invention may be bound to a substrate either 
covalently or noncovalently. The substrate can be porous or solid, planar or non-planar, 
unitary or distributed. The bound nucleic acid molecules may be used as hybridization 
probes, and may be labeled or unlabeled. In a preferred embodiment, the bound nucleic 

10 acid molecules are unlabeled. 

In one embodiment, the nucleic acid molecule of the present invention is bound to 
a porous substrate, e.g, a membrane, typically comprising nitrocellulose, nylon, or 
positively charged derivatized nylon. The nucleic acid molecule of the present invention 
can be used to detect a hybridizing nucleic acid molecule that is present within a labeled 

15 nucleic acid sample, e.g., a sample of transcript-derived nucleic acids. In another 

embodiment, the nucleic acid molecule is bound to a solid substrate, including, without 
limitation, glass, amorphous silicon, crystalline silicon or plastics. Examples of plastics 
include, without limitation, polymethylacrylic, polyethylene, polypropylene, polyacrylate, 
polymethylmethacrylate, polyvinylchloride, polytetrafluoroethylene, polystyrene, 

20 polycarbonate, polyacetal, polysulfone, celluloseacetate, cellulosenitrate, nitrocellulose, or 
mixtures thereof. The solid substrate may be any shape, including rectangular, disk-like 
and spherical. In a preferred embodiment, the solid substrate is a microscope slide or 
slide-shaped substrate. 

The nucleic acid molecule of the present invention can be attached covalently to a 

25 surface of the support substrate or applied to a derivatized surface in a chaotropic agent 
that facilitates denaturation and adherence by presumed noncovalent interactions, or some 
combination thereof. The nucleic acid molecule of the present invention can be bound to a 
substrate to which a plurality of other nucleic acids are concurrently bound, hybridization 
to each of the plurality of bound nucleic acids being separately detectable. At low density, 

30 e.g. on a porous membrane, these substrate-bound collections are typically denominated 
macroarrays; at higher density, typically on a solid support, such as glass, these substrate 
bound collections of plural nucleic acids are colloquially termed microarrays. As used 
herein, the term microarray includes arrays of all densities. It is, therefore, another aspect 
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of die invention to provide microarrays that comprise one or more of the nucleic acid 
molecules of the present invention. 

In yet another embodiment, the invention is directed to single exon probes based 
on the CSNAs disclosed herein. 

5 Expression Vectors, Host Cells and Recombinant Methods of Producing 

Polypeptides 

Another aspect of the present invention provides vectors that comprise one or more 
of the isolated nucleic acid molecules of the present invention, and host cells in which 
such vectors have been introduced. 
10 The vectors can be used, inter alia, for propagating the nucleic acid molecules of 

the present invention in host cells (cloning vectors), for shuttling the nucleic acid 
molecules of the present invention between host cells derived from disparate organisms 
(shuttle vectors), for inserting the nucleic acid molecules of the present invention into host 
cell chromosomes (insertion vectors), for expressing sense or antisense RNA transcripts of 
1 5 the nucleic acid molecules of the present invention in vitro or within a host cell, and for 
expressing polypeptides encoded by the nucleic acid molecules of the present invention, 
alone or as fusion proteins with heterologous polypeptides (expression vectors). Vectors 
are by now well known in the art, and are described, inter alia, in Jones et al. (eds.), 
Vectors: Cloning Applications: Essential Techniq ue (Essential Techniques Series), John 
20 Wiley & Son Ltd. (1998); Jones et al. (eds.), Vectors: Expressio n Systems: Essmiri.l 

Techniques (Essential Techniques Series), John Wiley & Son Ltd. (1998); Gacesa et al, 
Vectors: Essential Data, John Wiley & Sons Ltd. (1995); Cid-Arregui (eds.), Viral 
Vectors: Basic Science and Gene Therapy . Eaton Publishing Co. (2000); Sambrook 
(2001), supra; Ausubel (1999), supra. Furthermore, a variety of vectors are available 
25 commercially. Use of existing vectors and modifications thereof are well within the skill 
in the art. Thus, only basic features need be described here. 

Nucleic acid sequences may be expressed by operatively linking them to an 
expression control sequence in an appropriate expression vector and employing that 
expression vector to transform an appropriate unicellular host. Expression control 
sequences are sequences that control the transcription, post-transcriptional events and 
translation of nucleic acid sequences. Such operative linking of a nucleic acid sequence of 
this invention to an expression control sequence, of course, includes, if not already part of 
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the nucleic acid sequence, the provision of a translation initiation codon, ATG or GTG, in 
the correct reading frame upstream of the nucleic acid sequence. 

A wide variety of host/expression vector combinations may be employed in 
expressing the nucleic acid sequences of this invention. Useful expression vectors, for 

5 example, may consist of segments of chromosomal, non-chromosomal and synthetic 
nucleic acid sequences. . 

In one embodiment, prokaryotic cells may be used with an appropriate vector. 
Prokaryotic host cells are often used for cloning and expression. In a preferred 
embodiment, prokaryotic host cells include E. coli, Pseudomonas, Bacillus and 

10 Streptomyces. In a preferred embodiment, bacterial host cells are used to express the 
nucleic acid molecules of the instant invention. Useful expression vectors for bacterial 
hosts include bacterial plasmids, such as those from E. coli, Bacillus or Streptomyces, 
including pBluescript, pGEX-2T, pUC vectors, col El, pCRl, pBR322, pMB9 and their 
derivatives, wider host range plasmids, such as RP4, phage DNAs, e.g., the numerous 

15 derivatives of phage lambda, e.g., NM989, XGT10 and AGT11, and other phages, e.g., 
Ml 3 and filamentous single stranded phage DNA. Where E. coli is used as host, 
selectable markers are, analogously, chosen for selectivity in gram negative bacteria: e.g., 
typical markers confer resistance to antibiotics, such as ampicillin, tetracycline, 
chloramphenicol, kanamycin, streptomycin and zeocin; auxotrophic markers can also be 

20 used. 

In other embodiments, eukaryotic host cells, such as yeast, insect, mammalian or 
plant cells, may be used. Yeast cells, typically S. cerevisiae, are useful for eukaryotic 
genetic studies, due to the ease of targeting genetic changes by homologous recombination 
and the ability to easily complement genetic defects using recombinantly expressed 

25 proteins. Yeast cells are useful for identifying interacting protein components, e.g. 

through use of a two-hybrid system. In a preferred embodiment, yeast cells are useful for 
protein expression. Vectors of the present invention for use in yeast will typically, but not 
invariably, contain an origin of replication suitable for use in yeast and a selectable marker 
that is functional in yeast. Yeast vectors include Yeast Integrating plasmids (e.g., YIp5) 

30 and Yeast Replicating plasmids (the YRp and YEp series plasmids), Yeast Centromere 
plasmids (the YCp series plasmids), Yeast Artificial Chromosomes (YACs) which are 
based on yeast linear plasmids, denoted YLp, pGPD-2, 2\x plasmids and derivatives 
thereof, and improved shuttle vectors such as those described in Gietz et al, Gene, 74: 
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527-34 (1988) (YIplac, YEplac and YCplac). Selectable markers in yeast vectors include 
a variety of auxotrophic markers, the most common of which are (in Saccfiaromyces 
cerevisiae) URA3, HIS3, LEU2, TRP1 and LYS2, which complement specific 
auxotrophic mutations, such as ura3-52, his3-Dl, Ieu2-Dl, trpl-Dl and lys2-201. 
5 Insect cells may be chosen for high efficiency protein expression. Where the host 

cells are from Spodopterafrugiperda, e.g., Sf9 and Sf21 cell lines, and expresSF™ cells 
(Protein Sciences Corp., Meriden, CT, USA), the vector replicative strategy is typically 
based upon the baculovirus life cycle. Typically, baculovirus transfer vectors are used to 
replace the wild-type AcMNPV polyhedrin gene with a heterologous gene of interest. 

10 Sequences that flank the polyhedrin gene in the wild-type genome are positioned 5' and 3' 
of the expression cassette on the transfer Vectors. Following co-transfection with 
AcMNPV DNA, a homologous recombination event occurs between these sequences 
resulting in a recombinant virus carrying the gene of interest and the polyhedrin or plO 
promoter. Selection can be based upon visual screening for lacZ fusion activity. 

1 5 The host cells may also be mammalian cells, which are particularly useful for 

expression of proteins intended as pharmaceutical agents, and for screening of potential 
agonists and antagonists of a protein or a physiological pathway. Mammalian vectors 
intended for autonomous extrachromosomal replication will typically include a viral 
origin, such as the SV40 origin (for replication in cell lines expressing the large T-antigen, 

20 such as COS 1 and COS7 cells), the papillomavirus origin, or the EB V origin for long term 
episomal replication (for use, e.g., in 293-EBNA cells, which constitutively express the 
EBV EBNA-1 gene product and adenovirus El A). Vectors intended for integration, and 
thus replication as part of the mammalian chromosome, can, but need not, include an 
origin of replication functional in mammalian cells, such as the SV40 origin. Vectors 

25 based upon viruses, such as adenovirus, adeno-associated virus, vaccinia virus, and 

various mammalian retroviruses, will typically replicate according to the viral replicative 
strategy. Selectable markers for use in mammalian cells include, but are not limited to, 
resistance to neomycin (G418), blasticidin, hygromycin and zeocin, and selection based 
upon the purine salvage pathway using HAT medium. 

30 Expression in mammalian cells can be achieved using a variety of plasmids, 

including pSV2, pBC12BI, and p91023, as well as lytic virus vectors (e.g., vaccinia virus, 
adeno virus, and baculovirus), episomal virus vectors (e.g., bovine papillomavirus), and 
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retroviral vectors (e.g., murine retroviruses). Useful vectors for insect cells include 
baculoviral vectors and pVL 941. 

Plant cells can also be used for expression, with the vector replicon typically 
derived from a plant virus (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, 
5 TMV) and selectable markers chosen for suitability in plants. 

It is known that codon usage of different host cells may be different. For example, 
a plant cell and a human cell may exhibit a difference in codon preference for encoding a 
particular amino acid. As a result, human mRNA may not be efficiently translated in a 
plant, bacteria or insect host cell. Therefore, another embodiment of this invention is 

10 directed to codon optimization. The codons of the nucleic acid molecules of the invention 
may be modified to resemble, as much as possible, genes naturally contained within the 
host cell without altering the amino acid sequence encoded by the nucleic acid molecule. 

Any of a wide variety of expression control sequences may be used in these 
vectors to express the nucleic acid molecules of this invention. Such useful expression 

1 5 control sequences include the expression control sequences associated with structural 
genes of the foregoing expression vectors. Expression control sequences that control 
transcription include, e.g., promoters, enhancers and transcription termination sites. 
Expression control sequences in eukaryotic cells that control post-transcriptional events 
include splice donor and acceptor sites and sequences that modify the half-life of the 

20 transcribed RNA, e.g., sequences that direct poly (A) addition or binding sites for RNA- 
binding proteins. Expression control sequences that control translation include ribosome 
binding sites, sequences which direct targeted expression of the polypeptide to or within 
particular cellular compartments, and sequences in the 5' and 3' untranslated regions that 
modify the rate or efficiency of translation. 

25 Examples of useful expression control sequences for a prokaryote, e.g., E. coli, 

will include a promoter, often a phage promoter, such as phage lambda pL promoter, the 
trc promoter, a hybrid derived from the trp and lac promoters, the bacteriophage T7 
promoter (in E. coli cells engineered to express the T7 polymerase), the TAC or TRC 
system, the major operator and promoter regions of phage lambda, the control regions of 

30 fd coat protein, and the araBAD operon. Prokaryotic expression vectors may further 

include transcription terminators, such as the aspA terminator, and elements that facilitate 
translation, such as a consensus ribosome binding site and translation termination codon, 
Schomer et al. t Proc. Natl. Acad. Sci. USA 83: 8506-8510 (1986). 
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Expression control sequences for yeast cells, typically S. cerevisiae, will include a 
yeast promoter, such as the CYC1 promoter, the GAL1 promoter, the GAL10 promoter, 
ADH1 promoter, the promoters of the yeast a-mating system, or the GPD promoter, and 
will typically have elements that facilitate transcription termination, such as the 
5 transcription termination signals from the CYC1 or ADH1 gene. 

Expression vectors useful for expressing proteins in mammalian cells will include 
a promoter active in mammalian cells. These promoters include, but are not limited to, 
those derived from mammalian viruses, such as the enhancer-promoter sequences from the 
immediate early gene of the human cytomegalovirus (CMV), the enhancer-promoter 

10 sequences from the Rous sarcoma virus long terminal repeat (RSV LTR), the enhancer- 
promoter from SV40 and the early and late promoters of adenovirus. Other expression 
control sequences include the promoter for 3-phosphoglycerate kinase or other glycolytic 
enzymes, the promoters of acid phosphatase. Other expression control sequences include 
those from the gene comprising the CSNA of interest. Often, expression is enhanced by 

15 incorporation of polyadenylation sites, such as the late SV40 polyadenylation site and the 
polyadenylation signal and transcription termination sequences from the bovine growth 
hormone (BGH) gene, and ribosome binding sites. Furthermore, vectors can include 
introns, such as intron II of rabbit p-globin gene and the SV40 splice elements. 

Preferred nucleic acid vectors also include a selectable or amplifiable marker gene 

20 and means for amplifying the copy number of the gene of interest. Such marker genes are 
well known in the art. Nucleic acid vectors may also comprise stabilizing sequences (e.g., 
ori- or ARS-like sequences and telomere-like sequences), or may alternatively be designed 
to favor directed or non-directed integration into the host cell genome. In a preferred 
embodiment, nucleic acid sequences of this invention are inserted in frame into an 

25 expression vector that allows a high level expression of an RNA which encodes a protein 
comprising the encoded nucleic acid sequence of interest. Nucleic acid cloning and 
sequencing methods are well known to those of skill in the art and are described in an 
assortment of laboratory manuals, including Sambrook (1989), supra, Sambrook (2000), 
supra; Ausubel (1992), supra; and Ausubel (1999), supra. Product information from 

30 manufacturers of biological, chemical and immunological reagents also provide useful 
information. 

Expression vectors may be either constitutive or inducible. Inducible vectors 
include either naturally inducible promoters, such as the trc promoter, which is regulated 
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by the lac operon, and the pL promoter, which is regulated by tryptophan, the 
MMTV-LTR promoter, which is inducible by dexamethasone, or can contain synthetic 
promoters and/or additional elements that confer inducible control on adjacent promoters. 
Examples of inducible synthetic promoters are the hybrid Plac/ara-1 promoter and the 
5 PLtetO-1 promoter. The PLtetO-1 promoter takes advantage of the high expression levels 
from the PL promoter of phage lambda, but replaces the lambda repressor sites with two 
copies of operator 2 of the Tn 10 tetracycline resistance operon, causing this promoter to 
be tightly repressed by the Tet repressor protein and induced in response to tetracycline 
(Tc) and Tc derivatives such as anhydrotetracycline. Vectors may also be inducible 

10 because they contain hormone response elements, such as the glucocorticoid response 
element (GRE) and the estrogen response element (ERE), which can confer hormone 
inducibility where vectors are used for expression in cells having the respective hormone 
receptors. To reduce background levels of expression, elements responsive to ecdysone, 
an insect hormone, can be used instead, with coexpression of the ecdysone receptor. 

1 5 In one embodiment of the invention, expression vectors can be designed to fuse the 

expressed polypeptide to small protein tags that facilitate purification and/or visualization. 
Such tags include a polyhistidine tag that facilitates purification of the fusion protein by 
immobilized metal affinity chromatography, for example using NiNTA resin (Qiagen Inc., 
Valencia, CA, USA) or TALON™ resin (cobalt immobilized affinity chromatography 

20 medium, Clontech Labs, Palo Alto, CA, USA). The fusion protein can include a chitin- 
binding tag and self-excising intein, permitting chitin-based purification with self-removal 
of the fused tag (IMPACT™ system, New England Biolabs, Inc., Beverley, MA, USA). 
Alternatively, the fusion protein can include a calmodulin-binding peptide tag, permitting 
purification by calmodulin affinity resin (Stratagene, La Jolla, CA, USA), or a specifically 

25 excisable fragment of the biotin carboxylase carrier protein, permitting purification of in 
vivo biotinylated protein using an avidin resin and subsequent tag removal (Promega, 
Madison, WI, USA). As another useful alternative, the polypeptides of the present 
invention can be expressed as a fusion to glutathione-S-transferase, the affinity and 
specificity of binding to glutathione permitting purification using glutathione affinity 

30 resins, such as Glutathione-Superflow Resin (Clontech Laboratories, Palo Alto, CA, 

USA), with subsequent elution with free glutathione. Other tags include, for example, the 
Xpress epitope, detectable by anti-Xpress antibody (Invitrogen, Carlsbad, CA, USA), a 
myc tag, detectable by anti-myc tag antibody, the V5 epitope, detectable by anti-V5 
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antibody (Invitrogen, Carlsbad, CA, USA), FLAG® epitope, detectable by anti-FLAG® 
antibody (Stratagene, La Jolla, CA, USA), and the HA epitope, detectable by anti-HA 
antibody. 

For secretion of expressed polypeptides, vectors can include appropriate sequences 
5 that encode secretion signals, such as leader peptides. For example, the pSecTag2 vectors 
(Invitrogen, Carlsbad, CA, USA) are 5.2 kb mammalian expression vectors that carry the 
secretion signal from theV-J2-C region of the mouse Ig kappa-chain for efficient secretion 
of recombinant proteins from a variety of mammalian cell lines. 

Expression vectors can also be designed to fuse proteins encoded by the 

10 heterologous nucleic acid insert to polypeptides that are larger than purification and/or 

identification tags. Useful protein fusions include those that permit display of the encoded 
protein on the surface of a phage or cell, fusions to intrinsically fluorescent proteins, such 
as those that have a green fluorescent protein (GFP)-like chromophore, fusions to the IgG 
Fc region, and fusions for use in two hybrid systems. 

15 Vectors for phage display fuse the encoded polypeptide to, e.g., the gene DI 

protein (pill) or gene Vm protein (pVUl) for display on the surface of filamentous phage, 
such as M13. See Barbas et al, Phage Display: A Laboratory Manual. Cold Spring 
Harbor Laboratory Press (2001); Kay et al (eds.), Phage Display of Peptides and Proteins: 
A Laboratory Manual , Academic Press, Inc., (1996); Abelson et al (eds.), Combinatorial 

20 Chemistry (Methods in Enzymology, Vol. 267) Academic Press (1996). Vectors for yeast 
display, e.g. the pYDl yeast display vector (Invitrogen, Carlsbad, CA, USA), use the 
a-agglutinin yeast adhesion receptor to display recombinant protein on the surface of & 
cerevisiae. Vectors for mammalian display, e.g., the pDisplay™ vector (Invitrogen, 
Carlsbad, CA, USA), target recombinant proteins using an N-terminal cell surface 

25 targeting signal and a C-terminal transmembrane anchoring domain of platelet derived 
growth factor receptor. 

A wide variety of vectors now exist that fuse proteins encoded by heterologous 
nucleic acids to the chromophore of the substrate-independent, intrinsically fluorescent 
green fluorescent protein from Aequorea victoria ("GFP") and its variants. The GFP-like 

30 chromophore can be selected from GFP-like chromophores found in naturally occurring 
proteins, such as A. victoria GFP (GenBank accession number AAA27721), Renilla 
renifonnis GFP, FP583 (GenBank accession no. AF168419) (DsRed), FP593 (AF27271 1), 
FP483 (AF168420), FP484 (AF168424), FP595 (AF246709), FP486 (AF168421), FP538 
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(AF168423), and FP506 (AF168422), and need include only so much of the native protein 
as is needed to retain the chromophore's intrinsic fluorescence. Methods for determining 
the minimal domain required for fluorescence are known in the art. See Li et aL, J. Biol. 
Chem. 272: 28545-28549 (1997). Alternatively, the GFP-like chromophore can be 
5 selected from GFP-like chromophores modified from those found in nature. The methods 
for engineering such modified GFP-like chromophores and testing them for fluorescence 
activity, both alone and as part of protein fusions, are well known in the art. See Heim et 
aL, Curr. Biol. 6: 178-182 (1996) and Palm et aL, Methods EnzymoL 302: 378-394 (1999). 
A variety of such modified chromophores are now commercially available and can readily 

10 be used in the fusion proteins of the present invention. These include EGFP ("enhanced 
GFP"), EBFP ("enhanced blue fluorescent protein"), BFP2, EYFP ("enhanced yellow 
fluorescent protein"), ECFP ("enhanced cyan fluorescent protein") or Citrine. EGFP (see, 
e.g, Cormack et aL, Gene 173: 33-38 (1996); U.S. Patent Nos. 6,090,919 and 5,804,387, 
the disclosures of which are incorporated herein by reference in their entireties) is found 

15 on a variety of vectors, both plasmid and viral, which are available commercially 

(Clontech Labs, Palo Alto, CA, USA); EBFP is optimized for expression in mammalian 
cells whereas BFP2, which retains the original jellyfish codons, can be expressed in 
bacteria (see, e.g,. Heim et aL, Curr. BioL 6: 178-182 (1996) and Cormack et aL, Gene 
173: 33-38 (1996)). Vectors containing these blue-shifted variants are available from 

20 Clontech Labs (Palo Alto, CA, USA). Vectors containing EYFP, ECFP {see, e.g., Heim et 
aL, Curr. BioL 6: 178-182 (1996); Miyawaki et aL, Nature 388: 882-887 (1997)) and 
Citrine (see, e.g., Heikal et aL, Proc. Natl. Acad. Sci. USA 97: 11996-12001 (2000)) are 
also available from Clontech Labs. The GFP-like chromophore can also be drawn from 
other modified GFPs, including those described in U.S. Patent Nos. 6,124,128; 6,096,865; 

25 6,090,919; 6,066,476; 6,054,321; 6,027,881; 5,968,750; 5,874,304; 5,804,387; 5,777,079; 
5,741 ,668; and 5,625,048, the disclosures of which are incorporated herein by reference in 
their entireties. See also Conn (ed.), Green Fluorescent Protein (Methods in Enzymology, 
Vol. 302), Academic Press, Inc. (1999); Yang, et aL, J Biol Chem, 273: 8212-6 (1998); 
Bevis et al. , Nature Biotechnology, 20:83-7 (2002). The GFP-like chromophore of each 

30 of these GFP variants can usefully be included in the fusion proteins of the present 
invention. 

Fusions to the IgG Fc region increase serum half-life of protein pharmaceutical 
products through interaction with the FcRn receptor (also denominated the FcRp receptor 
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15 



and the Brambell receptor, FcRb), further described in International Patent Application 
Nos. WO 97/43316, WO 97/34631, WO 96/32478, and WO 96/18412, the disclosures of 
which are incorporated herein by reference in their entireties. 

For long-term, high-yield recombinant production of the polypeptides of the 
5 present invention, stable expression is preferred. Stable expression is readily achieved by 
integration into the host cell genome of vectors having selectable markers, followed by 
selection of these integrants. Vectors such as pUB6/V5-His A, B, and C (Invitrogen, 
Carlsbad, CA, USA) are designed for high-level stable expression of heterologous proteins 
in a wide range of mammalian tissue types and cell lines. pUB6/V5-His uses the 
1 0 promoter/enhancer sequence from the human ubiquitin C gene to drive expression of 

recombinant proteins: expression levels in 293, CHO, and NIH3T3 cells are comparable to 
levels from the CMV and human EF-la promoters. The bsd gene permits rapid selection 
of stably transfected mammalian cells with the potent antibiotic blasticidin. 

Replication incompetent retroviral vectors, typically derived from Moloney murine 
leukemia virus, also are useful for creating stable transfectants having integrated provirus. 
The highly efficient transduction machinery of retroviruses, coupled with the availability 
of a variety of packaging cell lines such as RetroPack™ PT 67, EcoPack2™-293, 
AmphoPack-293, and GP2-293 cell lines (all available from Clontech Laboratories, Palo 
Alto, CA USA) allow a wide host range to be infected with high efficiency; varying the 
20 multiplicity of infection readily adjusts the copy number of the integrated provirus. 

Of course, not all vectors and expression control sequences will function equally 
well to express the nucleic acid molecules of this invention. Neither will all hosts function 
equally well with the same expression system. However, one of skill in the art may make 
a selection among these vectors, expression control sequences and hosts without undue 
experimentation and without departing from the scope of this invention. For example, in 
selecting a vector, the host must be considered because the vector must be replicated in it 
The vector's copy number, the ability to control that copy number, the ability to control 
integration, if any, and the expression of any other proteins encoded by the vector, such as 
an antibiotic or other selection marker, should also be considered. The present invention 
30 further includes host cells comprising the vectors of the present invention, either present 
episomally within the cell or integrated, in whole or in part, into the host cell chromosome. 
Among other considerations, some of which are described above, a host cell strain may be 
chosen for its ability to process the expressed polypeptide in the desired fashion. Such 
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post-translational modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation, and it 
is an aspect of the present invention to provide CSPs with such post-translational 
modifications. 

5 In selecting an expression control sequence, a variety of factors should also be 

considered. These include, for example, the relative strength of the sequence, its 
controllability, and its compatibility with the nucleic acid molecules of this invention, 
particularly with regard to potential secondary structures. Unicellular hosts should be 
selected by consideration of their compatibility with the chosen vector, the toxicity of the 

10 product coded for by the nucleic acid sequences of this invention, their secretion 

characteristics, their ability to fold the polypeptide correctly, their fermentation or culture 
requirements, and the ease of purification from them of the products coded for by the 
nucleic acid molecules of this invention. 

The recombinant nucleic acid molecules and more particularly, the expression 

1 5 vectors of this invention may be used to express the polypeptides of this invention as 

recombinant polypeptides in a heterologous host cell. The polypeptides of this invention 
may be full-length or less than full-length polypeptide fragments recombinantly expressed 
from the nucleic acid molecules according to this invention. Such polypeptides include 
analogs, derivatives and muteins that may or may not have biological activity. 

20 Vectors of the present invention will also often include elements that permit in 

vitro transcription of RNA from the inserted heterologous nucleic acid. Such vectors 
typically include a phage promoter, such as that from T7, T3, or SP6, flanking the nucleic 
acid insert. Often two different such promoters flank the inserted nucleic acid, permitting 
separate in vitro production of both sense and antisense strands. 

25 Transformation and other methods of introducing nucleic acids into a host cell 

(e.g., conjugation, protoplast transformation or fusion, transfection, electroporation, 
liposome delivery, membrane fusion techniques, high velocity DNA-coated pellets, viral 
infection and protoplast fusion) can be accomplished by a variety of methods which are 
well known in the art (See, for instance, Ausubel, supra, and Sambrook et al, supra). 

30 Bacterial, yeast, plant or mammalian cells are transformed or transfected with an 

expression vector, such as a plasmid, a cosmid, or the like, wherein the expression vector 
comprises the nucleic acid of interest. Alternatively, the cells may be infected by a viral 
expression vector comprising the nucleic acid of interest. Depending upon the host cell, 
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vector, and method of transformation used, transient or stable expression of the 
polypeptide will be constitutive or inducible. One having ordinary skill in the art will be 
able to decide whether to express a polypeptide transiently or stably, and whether to 
express the protein constitutively or inducibly. 
5 A wide variety of unicellular host cells are useful in expressing the DNA 

sequences of this invention. These hosts may include well known eukaryotic and 
prokaryotic hosts, such as strains of, fungi, yeast, insect cells such as Spodoptera 
frugiperda (SF9), animal cells such as CHO, as well as plant cells in tissue culture. 
Representative examples of appropriate host cells include, but are not limited to, bacterial 
10 cells, such as E. coli, Caulobacter crescentus, Streptomyces species, and Salmonella 

typhimurium; yeast cells, such as Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Pichia pastoris, Pichia methanolica; insect cell lines, such as those from Spodoptera 
frugiperda, e.g., Sf9 and Sf21 cell lines, and expresSF™ cells (Protein Sciences Corp., 
Meriden, CT, USA), Drosophila S2 cells, and Trichoplusia ni High Five® Cells 
1 5 (Invitrogen, Carlsbad, CA, USA); and mammalian cells. Typical mammalian cells include 
BHK cells, BSC 1 cells, BSC 40 cells, BMT 10 cells, VERO cells, COS1 cells, COS7 
cells, Chinese hamster ovary (CHO) cells, 3T3 cells, NIH 3T3 cells, 293 cells, HEPG2 
cells, HeLa cells, L cells, MDCK cells, HEK293 cells, WI38 cells, murine ES cell lines 
(e.g, from strains 129/SV, C57/BL6, DBA-1, 129/SVJ), K562 cells, Jurkat cells, and 
20 BW5 147 cells. Other mammalian cell lines are well known and readily available from 
the American Type Culture Collection (ATCC) (Manassas, VA, USA) and the National 
Institute of General Medical Sciences (NIGMS) Human Genetic Cell Repository at the 
Coriell Cell Repositories (Camden, NJ, USA). Cells or cell lines derived from colon are 
particularly preferred because they may provide a more native post-translational 
25 processing. Particularly preferred are human colon cells. 

Particular details of the transfection, expression and purification of recombinant 
proteins are well documented and are understood by those of skill in the art. Further 
details on the various technical aspects of each of the steps used in recombinant 
production of foreign genes in bacterial cell expression systems can be found in a number 
30 of texts and laboratory manuals in the art See, e.g., Ausubel (1992), supra, Ausubel 
(1999), supra, Sambrook (1989), supra, and Sambrook (2001), supra. 
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Methods for introducing the vectors and nucleic acid molecules of the present 
invention into the host cells are well known in the art; the choice of technique will depend 
primarily upon the specific vector to be introduced and the host cell chosen. 

Nucleic acid molecules and vectors may be introduced into prokaryotes, such as E. 
5 coli, in a number of ways. For instance, phage lambda vectors will typically be packaged 
using a packaging extract (e.g., Gigapack® packaging extract, Stratagene, La Jolla, CA, 
USA), and the packaged virus used to infect E. coli. 

Plasmid vectors will typically be introduced into chemically competent or 
electrocompetent bacterial cells. E. coli cells can be rendered chemically competent by 

10 treatment, e.g., with CaCl 2 , or a solution of Mg 2+ , Mn 2+ , Ca 2+ , Rb + or K + , dimethyl 

sulfoxide, dithiothreitol, and hexamine cobalt (HI), Hanahan, J. Mol Biol 166(4): 557-80 
(1983), and vectors introduced by heat shock. A wide variety of chemically competent 
strains are also available commercially (e.g., Epicurian Coli® XLIO-Gold® 
Ultracompetent Cells (Stratagene, La Jolla, CA, USA); DH5a competent cells (Clontech 

15 Laboratories, Palo Alto, CA, USA); and TOP 10 Chemically Competent E. coli Kit 

(Invitrogen, Carlsbad, CA, USA)). Bacterial cells can be rendered electrocompetent to 
take up exogenous DNA by electroporation by various pre-pulse treatments; vectors are 
introduced by electroporation followed by subsequent outgrowth in selected media. An 
extensive series of protocols is provided by BioRad (Richmond, CA, USA). 

20 Vectors can be introduced into yeast cells by spheroplasting, treatment with 

lithium salts, electroporation, or protoplast fusion. Spheroplasts are prepared by the action 
of hydrolytic enzymes such as a snail-gut extract, usually denoted Glusulase or 
Zymolyase, or an enzyme from Arthrobacter luteus to remove portions of the cell wall in 
the presence of osmotic stabilizers, typically 1 M sorbitol. DNA is added to the 

25 spheroplasts, and the mixture is co-precipitated with a solution of polyethylene glycol 
(PEG) and Ca 2+ . Subsequently, the cells are resuspended in a solution of sorbitol, mixed 
with molten agar and then layered on the surface of a selective plate containing sorbitol. 

For lithium-mediated transformation, yeast cells are treated with lithium acetate to 
permeabilize the cell wall, DNA is added and the cells are co-precipitated with PEG. The 

30 cells are exposed to a brief heat shock, washed free of PEG and lithium acetate, and 
subsequently spread on plates containing ordinary selective medium. Increased 
frequencies of transformation are obtained by using specially-prepared single-stranded 



WO 2004/050858 



PCT/US2003/038808 



67 

carrier DNA and certain organic solvents. SchiestI et al, Curr. Genet 16(5-6): 339-46 
(1989). 

For electroporation, freshly-grown yeast cultures are typically washed, suspended 
in an osmotic protectant, such as sorbitol, mixed with DNA, and the cell suspension 
5 pulsed in an electroporation device. Subsequently, the cells are spread on the surface of 
plates containing selective media. Becker et al, Methods Enzymol 194: 182-187 (1991). 
The efficiency of transformation by electroporation can be increased over 100- fold by 
using PEG, single-stranded carrier DNA and cells that are in late log-phase of growth. 
Larger constructs, such as YACs, can be introduced by protoplast fusion. 

10 Mammalian and insect cells can be directly infected by packaged viral vectors, or 

transfected by chemical or electrical means. For chemical transfection, DNA can be 
coprecipitated with CaPC>4 or introduced using liposomal and nonliposomal lipid-based 
agents. Commercial kits are available for CaP04 transfection (CalPhos™ Mammalian 
Transfection Kit, Clontech Laboratories, Palo Alto, CA, USA), and lipid-mediated 

15 transfection can be practiced using commercial reagents, such as LIPOFECTAMINE™ 
2000, LIPOFECTAMINE™ Reagent, CELLFECTIN® Reagent, and LIPOFECTIN® 
Reagent (Invitrogen, Carlsbad, CA, USA), DOTAP Liposomal Transfection Reagent, 
FuGENE 6, X-tremeGENE Q2, DOSPER, (Roche Molecular Biochemicals, Indianapolis, 
IN USA), Effectene™, PolyFect®, Superfect® (Qiagen, Inc., Valencia, CA, USA). 

20 Protocols for electroporating mammalian cells can be found in, for example, ; Norton et 
al (eds.), Gene Transfer Methods: Introducing DNA into Living Cells and Organisms , 
BioTechniques Books, Eaton Publishing Co. (2000). Other transfection techniques 
include transfection by particle bombardment and microinjection. See, e.g., Cheng et al, 
Proc. Natl. Acad. ScL USA 90(10): 4455-9 (1993); Yang et al, Proc. Natl Acad. Sci. USA 

25 87(24): 9568-72 (1990). 

Production of the recombinantly produced proteins of the present invention can 
optionally be followed by purification. 

Purification of recombinantly expressed proteins is now well within the skill in the 
art and thus need not be detailed here. See, e.g., Thorner et al. (eds.), Applications of 

30 Chimeric Genes and Hybrid Proteins, Part A: Gene Expression and Protein Purification 
(Methods in Enzymology, Vol. 326), Academic Press (2000); Harbin (ed.), Cloning, Gene 
Expression and Protein Purification : Experimental Procedures and Process Rationale, 
Oxford Univ. Press (2001); Marshak et al, Strategies for Protein Purification and 
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Characterization: A Laboratory Course Manual Cold Spring Harbor Laboratory Press 
(1996); and Roe (ed.), Protein Purification Applications, Oxford University Press (2001). 

Briefly, however, if purification tags have been fused through use of an expression 
vector that appends such tags, purification can be effected, at least in part, by means 
5 appropriate to the tag, such as use of immobilized metal affinity chromatography for 
polyhistidine tags. Other techniques common in the art include ammonium sulfate 
fractionation, immunoprecipitation, fast protein liquid chromatography (FPLC), high 
performance liquid chromatography (HPLC), and preparative gel electrophoresis. 

Polypeptides, including Fragments Muteins, Homologous Proteins. Allelic Variants, 

10 Analogs and Derivatives 

Another aspect of the invention relates to polypeptides encoded by the nucleic acid 
molecules described herein. In a preferred embodiment, the polypeptide is a colon 
specific polypeptide (CSP). In an even more preferred embodiment, the polypeptide 
comprises an amino acid sequence of SEQ ID NO: 95-248 or is derived from a polypeptide 

1 5 having the amino acid sequence of SEQ ID NO: 95-248. A polypeptide as defined herein 
may be produced recombinantly, as discussed supra, may be isolated from a cell that 
naturally expresses the protein, or may be chemically synthesized following the teachings 
of the specification and using methods well known to those having ordinary skill in the art 
Polypeptides of the present invention may also comprise a part or fragment of a 

20 CSP. In a preferred embodiment, the fragment is derived from a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 95-248. 
Polypeptides of the present invention comprising a part or fragment of an entire CSP may 
or may not be CSPs. For example, a full-length polypeptide may be colon-specific, while 
a fragment thereof may be found in other tissues as well as in colon. A polypeptide that is 

25 not a CSP, whether it is a fragment, analog, mutein, homologous protein or derivative, is 
nevertheless useful, especially for immunizing animals to prepare anti-CSP antibodies. In 
a preferred embodiment, the part or fragment is a CSP. Methods of determining whether a 
polypeptide of the present invention is a CSP are described infra. 

Polypeptides of the present invention comprising fragments of at least 6 

30 contiguous amino acids are also useful in mapping B cell and T cell epitopes of the 
reference protein. See, e.g., Geysen etal, Proa Natl. Acad. ScL USA 81: 3998-4002 
(1984) and U.S. Patent Nos. 4,708,871 and 5,595,915, the disclosures of which are 
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incorporated herein by reference in their entireties. Because the fragment need not itself 
be immunogenic, part of an immunodominant epitope, nor even recognized by native 
antibody, to be useful in such epitope mapping, all fragments of at least 6 amino acids of a 
polypeptide of the present invention have utility in such a study. 
5 Polypeptides of the present invention comprising fragments of at least 8 

■'Jv ! contiguous amino acids, often at least 15 contiguous amino acids, are useful as 

immunogens for raising antibodies that recognize polypeptides of the present invention. 
See, e.g., Lerner, Nature 299: 592-596 (1982); Shinnick et al, Anna. Rev. Microbiol 37: 
425-46 (1983); Sutcliffe et ai, Science 219: 660-6 (1983). As further described in the 
10 above-cited references, virtually all 8-mers, conjugated to a carrier, such as a protein, 
prove immunogenic and are capable of eliciting antibody for the conjugated peptide; 
I accordingly, all fragments of at least 8 amino acids of the polypeptides of the present 

j invention have utility as immunogens. 

j Polypeptides comprising fragments of at least 8, 9, 10 or 12 contiguous amino 

15 acids are also useful as competitive inhibitors of binding of the entire polypeptide, or a 
portion thereof, to antibodies (as in epitope mapping), and to natural binding partners, 
j such as subunits in a multimeric complex or to receptors or ligands of the subject protein; 

this competitive inhibition permits identification and separation of molecules that bind 
specifically to the polypeptide of interest. See U.S. Patent Nos. 5,539,084 and 5,783,674, 
20 incorporated herein by reference in their entireties. 

The polypeptide of the present invention thus preferably is at least 6 amino acids in 
length, typically at least 8, 9, 10 or 12 amino acids in length, and often at least 15 amino 
acids in length. Often, the polypeptide of the present invention is at least 20 amino acids 
in length, even 25 amino acids, 30 amino acids, 35 amino acids, or 50 amino acids or more 
25 in length. Of course, larger polypeptides having at least 75 amino acids, 100 amino acids, 
or even 150 amino acids are also useful, and at times preferred. 

One having ordinary skill in the art can produce fragments by truncating the 
nucleic acid molecule, e.g., a CSNA, encoding the polypeptide and then expressing it 
recombinantly. Alternatively, one can produce a fragment by chemically synthesizing a 
30 portion of the full-length polypeptide. One may also produce a fragment by enzymatically 
j . cleaving either a recombinant polypeptide or an isolated naturally occurring polypeptide, 

j Methods of producing polypeptide fragments are well known in the art. See, e.g., 

Sambrook (1989), supra; Sambrook (2001), supra; Ausubel (1992), supra; and Ausubel 

. • i 
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(1999), supra. In one embodiment, a polypeptide comprising only a fragment, preferably 
a fragment of a CSP, may be produced by chemical or enzymatic cleavage of a CSP 
polypeptide. In a preferred embodiment, a polypeptide fragment is produced by 
expressing a nucleic acid molecule of the present invention encoding a fragment, 
5 preferably of a CSP, in a host cell. 

Polypeptides of the present invention are also inclusive of mutants, fusion proteins, 
homologous proteins and allelic variants. 

A mutant protein, or mutein, may have the same or different properties compared 
to a naturally occurring polypeptide and comprises at least one amino acid insertion, 

10 duplication, deletion, rearrangement or substitution compared to the amino acid sequence 
of a native polypeptide. Small deletions and insertions can often be found that do not alter 
the function of a protein. Muteins may or may not be colon-specific. Preferably, the 
mutein is colon-specific. More preferably the mutein is a polypeptide that comprises at 
least one amino acid insertion, duplication, deletion, rearrangement or substitution 

1 5 compared to the amino acid sequence of SEQ ID NO: 95-248. Accordingly, in a preferred 
embodiment, the mutein is one that exhibits at least 50% sequence identity, more 
preferably at least 60% sequence identity, even more preferably at least 70%, yet more 
preferably at least 80% sequence identity to a CSP comprising an amino acid sequence of 
SEQ ED NO: 95-248. In a yet more preferred embodiment, the mutein exhibits at least 

20 85%, more preferably 90%, even more preferably 95% or 96%, and yet more preferably at 
least 97%, 98%, 99% or 99.5% sequence identity to a CSP comprising an amino acid 
sequence of SEQ ID NO: 95-248. 

A mutein may be produced by isolation from a naturally occurring mutant cell, 
tissue or organism. A mutein may be produced by isolation from a cell, tissue or organism 

25 that has been experimentally mutagenized. Alternatively, a mutein may be produced by 
chemical manipulation of a polypeptide, such as by altering the amino acid residue to 
another amino acid residue using synthetic or semi-synthetic chemical techniques. In a 
preferred embodiment, a mutein is produced from a host cell comprising a mutated nucleic 
acid molecule compared to the naturally occurring nucleic acid molecule. For instance, 

30 one may produce a mutein of a polypeptide by introducing one or more mutations into a 
nucleic acid molecule of the invention and then expressing it recombinantly. These 
mutations may be targeted, in which particular encoded amino acids are altered, or may be 
untargeted, in which random encoded amino acids within the polypeptide are altered. 
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Muteins with random amino acid alterations can be screened for a particular biological 
activity or property, particularly whether the polypeptide is colon-specific, as described 
below. Multiple random mutations can be introduced into the gene by methods well 
known to the art, e.g., by error-prone PCR, shuffling, oligonucleotide-directed 
5 mutagenesis, assembly PCR, sexual PCR mutagenesis, in vivo mutagenesis, cassette 

mutagenesis, recursive ensemble mutagenesis,exponential ensemble mutagenesis and site- 
specific mutagenesis. Methods of producing muteins with targeted or random amino acid 
alterations are well known in the art. See, e.g., Sambrook (1989), supra; Sambrook 
(2001), supra; Ausubel (1992), supra; and Ausubel (1999), as well as U.S. Patent No. 

10 5,223,408, which is herein incorporated by reference in its entirety. 

The invention also contemplates polypeptides that are homologous to a 
polypeptide of the invention. In a preferred embodiment, the polypeptide is homologous 
to a CSP. In an even more preferred embodiment, the polypeptide is homologous to a 
CSP selected from the group having an amino acid sequence of SEQ ID NO: 95-248. By 

15 homologous polypeptide it is meant one that exhibits significant sequence identity to a 
CSP, preferably a CSP having an amino acid sequence of SEQ ID NO: 95-248. By 
significant sequence identity it is meant that the homologous polypeptide exhibits at least 
50% sequence identity, more preferably at least 60% sequence identity, even more 
preferably at least 70%, yet more preferably at least 80% sequence identity to a CSP 

20 comprising an amino acid sequence of SEQ ID NO: 95-248. More preferred are 
homologous polypeptides exhibiting at least 85%, more preferably 90%, even more 
preferably 95% or 96%, and yet more preferably at least 97% or 98% sequence identity to 
a CSP comprising an amino acid sequence of SEQ ID NO: 95-248. Most preferably, the 
homologous polypeptide exhibits at least 99%, more preferably 99.5%, even more 

25 preferably 99.6%, 99.7%, 99.8% or 99.9% sequence identity to a CSP comprising an 

amino acid sequence of SEQ ID NO: 95-248. In a preferred embodiment, the amino acid 
substitutions of the homologous polypeptide are conservative amino acid substitutions as 
discussed supra. 

Homologous polypeptides of the present invention also comprise polypeptide 
30 encoded by a nucleic acid molecule that selectively hybridizes to a CSNA or an antisense 
sequence thereof. In this embodiment, it is preferred that the homologous polypeptide be 
encoded by a nucleic acid molecule that hybridizes to a CSNA under low stringency, 
moderate stringency or high stringency conditions, as defined herein. More preferred is a 
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homologous polypeptide encoded by a nucleic acid sequence which hybridizes to a CSNA 
selected from the group consisting of SEQ ID NO: 1-94 or a homologous polypeptide 
encoded by a nucleic acid molecule that hybridizes to a nucleic acid molecule that encodes 
a CSP, preferably a CSP of SEQ ID NO:95-248 under low stringency, moderate 
5 stringency or high stringency conditions, as defined herein. 

Homologous polypeptides of the present invention may be naturally occurring and 
derived from another species, especially one derived from another primate, such as 
chimpanzee, gorilla, rhesus macaque, or baboon, wherein the homologous polypeptide 
comprises an amino acid sequence that exhibits significant sequence identity to that of 

10 SEQ ID NO: 95-248. The homologous polypeptide may also be a naturally occurring 
polypeptide from a human, when the CSP is a member of a family of polypeptides. The 
homologous polypeptide may also be a naturally occurring polypeptide derived from a 
non-primate, mammalian species, including without limitation, domesticated species, e.g., 
dog, cat, mouse, rat, rabbit, guinea pig, hamster, cow, horse, goat or pig. The homologous 

1 5 polypeptide may also be a naturally occurring polypeptide derived from a non-mammalian 
species, such as birds or reptiles. The naturally occurring homologous protein may be 
isolated directly from humans or other species. Alternatively, the nucleic acid molecule 
encoding the naturally occurring homologous polypeptide may be isolated and used to 
express the homologous polypeptide recombinantly. The homologous polypeptide may 

20 also be one that is experimentally produced by random mutation of a nucleic acid 
molecule and subsequent expression of the nucleic acid molecule. Alternatively, the 
homologous polypeptide may be one that is experimentally produced by directed mutation 
of one or more codons to alter the encoded amino acid of a CSP. In a preferred 
embodiment, the homologous polypeptide encodes a polypeptide that is a CSP. 

25 Relatedness of proteins can also be characterized using a second functional test, 

such asthe ability of a first protein competitively to inhibit the binding of a second protein 
to an antibody. It is, therefore, another aspect of the present invention to provide isolated 
polpeptides not only identical in sequence to those described with particularity herein, but 
also to provide isolated polypeptides ("cross-reactive proteins") that competitively inhibit 

30 the binding of antibodies to all or to a portion of the isolated polypeptides of the present 
invention. Such competitive inhibition can readily be determined using immunoassays 
well known in the art 
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As discussed above, single nucleotide polymorphisms (SNPs) occur frequently in 
eukaryotic genomes, and the sequence determined from one individual of a species may 
differ from other allelic forms present within the population. Thus, polypeptides of the 
; present invention are also inclusive of those encoded by an allelic variant of a nucleic acid 

5 molecule encoding a CSP. In this embodiment, it is preferred that the polypeptide be 
V-J encoded by an allelic variant of a gene that encodes a polypeptide having the amino acid 

j sequence selected from the group consisting of SEQ ID NO: 95-248. More preferred is 

! that the polypeptide be encoded by an allelic variant of a gene that has the nucleic acid 

; sequence selected from the group consisting of SEQ ID NO: 1-94. 

j 10 Polypeptides of the present invention are also inclusive of derivative polypeptides 

encoded by a nucleic acid molecule according to the instant invention. In this 

| embodiment, it is preferred that the polypeptide be a CSP. Also preferred are derivative 

I 

polypeptides having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 95-248 and which has been acetylated, carboxylated, phosphorylated, 
| 1 5 glycosylated, ubiquitinated or post-translationally modified in another manner. In another 

| preferred embodiment, the derivative has been labeled with, e.g., radioactive isotopes such 

j j25 32 35 3 

; as I, P, S, and H. In another preferred embodiment, the derivative has been labeled 

j with fluorophores, chemiluminescent agents, enzymes, and antiligands that can serve as 

specific binding pair members for a labeled ligand. 
20 Polypeptide modifications are well known to those of skill and have been 

i 

described in great detail in the scientific literature. Several particularly common 
modifications, glycosylation, lipid attachment, sulfation, gamma-carboxylation of 
glutamic acid residues, hydroxylation and ADP-ribosylation, for instance, are described in 
most basic texts, such as, for instance Creighton, Protein Structure and Molecular 
25 Properties. 2nd ed., W. H. Freeman and Company (1993). Many detailed reviews are 

available on this subject, such as, for example, those provided by Wold, in Johnson (ed.), 
Posttranslational Covalent Modification of Proteins , pgs. 1-12, Academic Press (1983); 
. ... Seifter et al, Meth. Enzymol 182: 626-646 (1990) and Rattan et al.Ann. N.Y. Acad. Sci. 

663:48-62(1992). 

j 30 One may determine whether a polypeptide of the invention is likely to be post- 

| translationaily modified by analyzing the sequence of the polypeptide to determine if there 

j are peptide motifs indicative of sites for post-translational modification. There are a 

number of computer programs that permit prediction of post-translational modifications. 

-. t 
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See, e.g., expasy.org (accessed November 1 1, 2002) on the world wide web, which 
includes PSORT, for prediction of protein sorting signals and localization sites, SignalP, 
for prediction of signal peptide cleavage sites, MITOPROT and Predotar, for prediction of 
mitochondrial targeting sequences, NetOGlyc, for prediction of type Oglycosylation sites 
5 in mammalian proteins, big-PI Predictor and DGPI, for prediction of prenylation-anchor 
and cleavage sites, and NetPhos, for prediction of Ser, Thr and Tyr phosphorylation sites 
in eukaryotic proteins. Other computer programs, such as those included in GCG, also 
may be used to determine post-translational modification peptide motifs. 

General examples of types of post-translational modifications include, but are not 

10 limited to: (Z)-dehydrobutyrine; 1-chondroitin sulfate-L-aspartic acid ester; T-glycosyl-L- 
tryptophan; l'-phospho-L-histidine; 1-thioglycine; 2 , -(S-L-cysteinyl)-L-histidine; 2 f -[3- 
carboxamido (trimethylammonio)propyl]-L-histidine; 2 f -alpha-mannosyl-L-tryptophan; 2- 
methyl-L-glutamine; 2-oxobutanoic acid; 2-pyrrolidone carboxylic acid; S^O'-L-histidyl)- 
L-tyrosine; 3H8alpha-FAD)-L4ustidine; 3HS-L-cysteinyl)-L-tyrosine; 3', 3 ,, ,5'-triiodo-L- 

15 thyronine; 3M ! -phospho-L-tyrosine; 3-hydroxy-L-proline; 3-methyl-L-histidine; 3- 

methyl-L-lanthionine; 3-phospho-L-histidine; 4'-(L-tryptophan)-L-tryptophyl quinone; 42 
N-cysteinyl-glycosylphosphatidylinositolethanolamine; 43 -(T-L-histidyl)-L-tyrosine; 4- 
hydroxy-L-arginine; 4-hydroxy-L-lysine; 4-hydroxy-L-proline; 5 1 -(N6-L-lysine)-L- 
topaquinone; 5-hydroxy-L-lysine; 5-methyl-L-arginine; alpha-l-microglobulin-Ig alpha 

20 complex chromophore; bis-L-cysteinyl bis-L-histidino diiron disulfide; bis-L-cysteinyl-L- 
N3 f -histidino-L-serinyI tetrairon 1 tetrasulfide; chondroitin sulfate D-glucuronyl-D- 
galactosyl-D-galactosyl-D-xylosyl-L-serine; D-alanine; D-allo-isoleucine; D-asparagine; 
dehydroalanine; dehydrotyrosine; dermatan 4-sulfate D-glucuronyl-D-gaiactosyl-D- 
galactosyl-D-xylosyl-L-serine; D-glucuronyl-N-glycine; dipyrrolylmethanemethyl-L- 

25 cysteine; D-leucine; D-methionine; D-phenylalanine; D-serine; D-tryptophan; glycine 

amide; glycine oxazolecarboxylic acid; glycine thiazolecarboxylic acid; heme P450-bis-L- 
cysteine-L-tyrosine; heme-bis-L-cysteine; hemediol-L-aspartyl ester-L-glutamyl ester; 
hemediol-L-aspartyl ester-L-glutamyl ester-L-methionine sulfonium; heme-L-cysteine; 
heme-L-histidine; heparan sulfate D-glucuronyl-D-galactosyl-D-galactosyl-D-xylosyl-L- 

30 serine; heme P450-bis-L-cysteine-L-lysine; hexakis-L-cysteinyl hexairon hexasulfide; 
keratan sulfate D-glucuronyl-D-galactosyl-D-galactosyl-D-xylosyl-L-lhreonine; L 
oxoalanine- lactic acid; L phenyllactic acid; l ! -(8alpha-FAD)-L-histidine; l/£A\$- 
topaquinone; L-3^4Mihydroxyphenylalanine; L-3\4\5 f -trihydroxyphenylalanine; L-4- 
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bromophenylalanine; L-6'-bromotryptophan; L-alanine amide; L-alanyl imidazolinone 
glycine; L-allysine; L-arginine amide; L-asparagine amide; L-aspartic 4-phosphoric 
anhydride; L-aspartic acid 1 -amide; L-beta-methylthioaspartic acid; L-bromohistidine; L- 
citrulline; L-cysteine amide; L-cysteine glutathione disulfide; L-cysteine methyl disulfide; 
5 L-cysteine methyl ester; L-cysteine oxazolecarboxylic acid; L-cysteine 

oxazolinecarboxylic acid; L-cysteine persulfide; L-cysteine sulfenic acid; L-cysteine 
sulfinic acid; L-cysteine thiazolecarboxylic acid; L-cysteinyl homocitryl molybdenum- 
heptairon-nonasulfide; L-cysteinyl imidazolinone glycine; L-cysteinyl molybdopterin; L- 
cysteinyl molybdopterin guanine dinucleotide; L-cystine; L-erythro-beta- 
10 hydroxyasparagine; L-eiythro-beta-hydroxyaspartic acid; L-gamma-carboxyglutamic acid; 
L-glutamic acid 1 -amide; L-glutamic acid 5-methyl ester; L-glutamine amide; L-glutamyl 
5-glycerylphosphorylethanolarnine; L-histidine amide; L-isoglutamyl-polyglutamic acid; 
L-isoglutamyl-polyglycine; L-isoleucine amide; L-lanthionine; L-Ieucine amide; L-lysine 
amide; L-lysine thiazolecarboxylic acid; L-lysinoalanine; L-methionine amide; L- 
15 methionine sulfone; L-phenyalanine thiazolecarboxylic acid; L-phenylalanine amide; L- 
proline amide; L-selenocysteine; L-selenocysteinyl molybdopterin guanine dinucleotide; 
L-serine amide; L-serine thiazolecarboxylic acid; L-seryl imidazolinone glycine; L-T- 
bromophenylalanine; L-T-bromophenylalanine; L-threonine amide; L-thyroxine; L- 
ttyptophan amide; L-tryptophyl quinone; L-tyrosine amide; L-valine amide; meso- 
20 lanthionine; N-(L-glutamyl)-L-tyrosine; N-(L-isoaspartyl)-glycine; N-(L-isoaspartyl)-L- 
cysteine; N,N,N-trimethyl-L-alanine; N,N-dimethyl-L-proline; N2-acetyl-L-lysine; N2- 
succinyl-L-tryptophan; N4-(ADP-ribosyl)-L-asparagine; N4-glycosyI-L-asparagine; N4- 
hydroxymethyl-L-asparagine; N4-methyl-L-asparagine; N5-methyl-L-glutamine; N6- 1 - 
carboxyethyl-L-lysine; N6-(4-amino hydroxybutyl)-L-lysine; N6-(L-isoglutamyl)-L- 
25 lysine; N6-(phospho-5 -adenosine)-L-lysine; ^-(phospho-S^guanosine^L-tysine; 

N6,N6,N6-trimethyl-L-lysine; N6,N6-dimethyl-L-lysine; N6-acetyl-L-lysine; N6-biotinyl- 
L-lysine; N6-carboxy-L-lysine; N6-formyl-L-lysine; N6-glycyl-L-lysine; N6-lipoyl-L- 
lysine; N6-methyl-L-lysine; N6-methyl-N6-poly(N-methyl-propylamine)i-lysine; N6- 
mureinyl-L-Iysine; N6-myristoyl-L-lysine; N6-palmitoyl-L-lysine; N6-pyridoxal 
30 phosphate-L-lysine; N6-pyruvic acid 2-iminyl-L-lysine; N6-retinai-L-lysine; N- 

acetylglycine; N-acetyl-L-glutamine; N-acetyl-L-alanine; N-acetyl-L-aspartic acid; N- 
acetyl-L-cysteine; N-acetyl-L-glutamic acid; N-acetyl-L-isoleucine; N-acetyl-L- 
methionine; N-acetyl-L-proline; N-acetyl-L-serine; N-acetyl-L-threonine; N-acetyl-L- 
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tyrosine; N-acetyl-L-valine; N-alanyl-glycosylphosphatidylinositolethanolamine; N- 
! asparaginyl-glycosylphosphatidylinositolethaaolarnine; N-aspartyl- 

j glycosylphosphatidylinositolethanolamine; N-formylglycine; N-formyl-L-methionine; N- 

I glycyl-glycosylphosphatidylinositolethanolamine; N-L-glutamyl-poly-L-giutamic acid; N- 

5 methylglycine; N-methyl-L-alanine; N-methyl-L-methionine; N-mettiyl-L-phenylalanine; 
:.:"•"•! N-myristoyl-glycine; N-palmitoyl-L-cysteine; N-pyruvic acid 2-iminyl-L-cysteine; N- 

pyruvic acid 2-iminyl-L-valine; N-seryl-glycosylphosphatidylinositolethanolamine; N- 
seryl-glycosyCSPhingolipidinositolethanolamine; 0-(ADP-ribosyl)-L-serine; 0-(phospho- 
i S'-adenosine^-threonine; 0-(phospho-5'-DNA)-L-serine; 0-(phospho-5'-DNA)-L- 

, j 10 threonine; 0-(phospho-5 ! rRNA)-L-serine; 0-(phosphoribosyl dephospho-coenzyme A)-L- 

J serine; 0-(sn-l-glycerophosphoiyl)-L-serine; 04 , -(8alpha-FAD)-L-tyrosine; 04'-(phospho- 

! 5 , -adenosine)-L-tyrosine; 04 , -(phospho-5'-DNA)-L-tyrosine; 04 , -(phospho-5 , -RNA)-L- 

j tyrosine; 04 , -(phospho-5 , -uridine)-L-tyrosine; 04-glycosyl-L-hydroxyproline; 04'- 

| glycosyl-L-tyrosine; 04 , -sulfo-L-tyrosine; 05-glycosyl-L-hydroxylysine; O-glycosyl-L- 

I 15 serine; O-glycosyl-L-threonine; omega-N-(ADP-ribosyl)-L-arginine; omega-N-omega-N- 

dimethyl-L-arginine; omega-N-methyl-L-arginine; omega-N-omega-N-dimethyl-L- 
arginine; omega-N-phospho-L-arginine; O f octanoyl-L-serine; O-palmitoyl-L-serine; O- 
j palmitoyl-L-threonine; O-phospho-L-serine; O-phospho-L-threonine; O- 

! phosphopantetheine-L-serine; phycoerythrobilin-bis-L-cysteine; phycourobilin-bis-L- 

20 cysteine; pyrroloquinoline quinone; pyruvic acid; S hydroxycinnamyl-L-cysteine; S-(2- 
aminovinyl) methyl-D-eysteine; S-(2-aminovinyl)-D-cysteine; S-(6-FW-L-cysteine; S- 
(8alpha-FAD)-L-cysteine; S-(ADP-ribosyl)-L-cysteine; S-(L-isoglutamyl)-L-cysteine; S- 
12-hydroxyfamesyl-L-cysteine; S-acetyl-L-cysteine; S-diacylglycerol-L-cysteine; S- 
I diphytanylglycerot diether-L-cysteine; S-farnesyl-L-cysteine; S-geranylgeranyl-L- 

25 cysteine; S-glycosyl-L-cysteine; S-glycyl-L-cysteine; S-methyl-L-cysteine; S-nitrosyl-L- 
cysteine; S-palmitoyl-L-cysteine; S-phospho-L-cysteine; S-phycobiliviolin-L-cysteine; S- 
phycocyanobilin-L-cysteine; S-phycoerythrobilin-L-cysteine; S-phytochromobiiin-L- 
. ;j cysteine; S-selenyl-L-cysteine; S-sulfo-L-cysteine; tetrakis-L-cysteinyl diiron disulfide; 

tetrakis-L-cysteinyl iron; tetrakis-L-cysteinyl tetrairon tetrasulfide; trans-2,3-cis 4- 
| 30 dihydroxy-L-proline; tris-L-cysteinyl triiron tetrasulfide; tris-L-cysteinyl triiron trisulfide; 

tris-L-cysteinyl-L-aspartato tetrairon tetrasulfide; tris-L-cysteinyl-L-cysteine persulfido- 
bis-L-glutamato-L-histidino tetrairon disulfide trioxide; tris-L-cysteinyl-L-N3 f -bistidino 
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tetrairon tetrasulfide; tris-L-cysteinyl-L-M-histidino tetrairon tetrasulfide; and tris-L- 
cysteinyl-L-serinyl tetrairon tetrasulfide. 

Additional examples of PTMs may be found in web sites such as the Delta Mass 
database based on Krishna, R. G. and F. Wold (1998). Posttranslational Modifications. 

5 Proteins - Analysis and Design. R. H. Angeletti. San Diego, Academic Press. 1: 121-206; 
Methods in Enzymology, 193, J.A. McClosky (ed) (1990), pages 647-660; Methods in 
Protein Sequence Analysis edited by Kazutomo Imahori and Fumio Sakiyama, Plenum 
Press, (1993) "Post-translational modifications of proteins" R.G. Krishna and F, Wold 
pages 167-172; "GlycoSuiteDB: a new curated relational database of glycoprotein glycan 

0 structures and their biological sources" Cooper et al. Nucleic Acids Res. 29; 332-335 

(2001) "O-GLYCBASE version 4.0: a revised database of O-giycosylated proteins" Gupta 
et al. Nucleic Acids Research, 27: 370-372 (1999); and "PhosphoBase, a database of 
phosphorylation sites: release 2.0.", Kreegipuu et al.Nucleic Acids Res 27(l):237-239 
(1999) see also, WO 02/21 139A2, the disclosure of which is incorporated herein by 

5 reference in its entirety. 

Tumorigenesis is often accompanied by alterations in the post-translational 
modifications of proteins. Thus, in another embodiment, the invention provides 
polypeptides from cancerous cells or tissues that have altered post-translational 
modifications compared to the post-translational modifications of polypeptides from 

3 normal cells or tissues. A number of altered post-translational modifications are known. 
One common alteration is a change in phosphorylation state, wherein the polypeptide from 
the cancerous cell or tissue is hyperphosphorylated or hypophosphorylated compared to 
the polypeptide from a normal tissue, or wherein the polypeptide is phosphorylated on 
different residues than the polypeptide from a normal cell. Another common alteration is 

5 a change in glycosylation state, wherein the polypeptide from the cancerous cell or tissue 
has more or less glycosylation than the polypeptide from a normal tissue, and/or wherein 
the polypeptide from the cancerous cell or tissue has a different type of glycosylation than 
the polypeptide from a noncancerous cell or tissue. Changes in glycosylation may be 
critical because carbohydrate-protein and carbohydrate-carbohydrate interactions are 

) important in cancer cell progression, dissemination and invasion. See, e.g., Barchi, Carr. 
Pharrn. Des. 6: 485-501 (2000), Verma, Cancer Biochem. Biophys. 14: 151-162 (1994) 
and Dennis et al., Bioessays 5: 412-421 (1999). 
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Another post-translational modification that may be altered in cancer cells is 
prenylation- Prenylation is the covalent attachment of a hydrophobic prenyl group (either 
farnesyl or geranylgeranyl) to a polypeptide. Prenylation is required for localizing a 
protein to a cell membrane and is often required for polypeptide function. For instance, 

5 the Ras superfamily of GTPase signalling proteins must be prenylated for function in a 
cell. See, e.g., Prendergast et ai., Semin. Cancer Biol. 10: 443-452 (2000) and Khwaja et 
al., Lancet 355: 741-744 (2000). 

Other post-translation modifications that may be altered in cancer cells include, 
without limitation, polypeptide methylation, acetylation, arginylation or racemization of 

10 amino acid residues. In these cases, the polypeptide from the cancerous cell may exhibit 
either increased or decreased amounts of the post-translational modification compared to 
the corresponding polypeptides from noncancerous cells. 

Other polypeptide alterations in cancer cells include abnormal polypeptide 
cleavage of proteins and aberrant protein-protein interactions. Abnormal polypeptide 

15 cleavage may be cleavage of a polypeptide in a cancerous cell that does not usually occur 
in a normal cell, or a lack of cleavage in a cancerous cell, wherein the polypeptide is 
cleaved in a normal cell. Aberrant protein-protein interactions may be either covalent 
cross-linking or non-covalent binding between proteins that do not normally bind to each 
other. Alternatively, in a cancerous cell, a protein may fail to bind to another protein to 

20 which it is bound in a noncancerous cell. Alterations in cleavage or in protein-protein 
interactions may be due to over- or underproduction of a polypeptide in a cancerous cell 
compared to that in a normal cell, or may be due to alterations in post-translational 
modifications (see above) of one or more proteins in the cancerous cell. See, e.g., 
Henschen-Edman,^/*. KY. Acad. ScL 936: 580-593 (2001). 

25 Alterations in polypeptide post-translational modifications, as well as changes in 

polypeptide cleavage and protein-protein interactions, may be determined by any method 
known in the art. For instance, alterations in phosphorylation may be determined by using 
anti-phosphoserine, anti-phosphothreonine or anti-phosphotyrosine antibodies or by amino 
acid analysis. Glycosylation alterations may be determined using antibodies specific for 

30 different sugar residues, by carbohydrate sequencing, or by alterations in the size of the 
glycoprotein, which can be determined by, e.g., SDS polyacrylamide gel electrophoresis 
(PAGE). Other alterations of post-translational modifications, such as prenylation, 
racemization, methylation, acetylation and arginylation, may be determined by chemical 
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analysis, protein sequencing, amino acid analysis, or by using antibodies specific for the 
particular post-translational modifications. Changes in protein-protein interactions and in 
polypeptide cleavage may be analyzed by any method known in the art including, without 
limitation, non-denaturing PAGE (for non-covalent protein-protein interactions), SDS 
5 PAGE (for covalent protein-protein interactions and protein cleavage), chemical cleavage, 
j protein sequencing or immunoassays. 

In another embodiment, the invention provides polypeptides that have been post- 
translationally modified. In one embodiment, polypeptides may be modified 
enzymatically or chemically, by addition or removal of a post-translational modification. 
10 For example, a polypeptide may be glycosylated or deglycosylated enzymatically. 

Similarly, polypeptides may be phosphorylated using a purified kinase, such as a MAP 
kinase (e.g, p38, ERK, or JNK) or a tyrosine kinase (e.g., Src or erbB2). A polypeptide J 
may also be modified through synthetic chemistry. Alternatively, one may isolate the 
i polypeptide of interest from a cell or tissue that expresses the polypeptide with the desired 

15 post-translational modification. In another embodiment, a nucleic acid molecule encoding 
the polypeptide of interest is introduced into a host cell that is capable of post- 
translationally modifying the encoded polypeptide in the desired fashion. If the 
polypeptide does not contain a motif for a desired post-translational modification, one may 
alter the post-translational modification by mutating the nucleic acid sequence of a nucleic 
20 acid molecule encoding the polypeptide so that it contains a site for the desired post- 
translational modification. Amino acid sequences that may be post-translationally ] 
modified are known in the art. See, e.g., the programs described above on the website 
expasy.org of the world wide web. The nucleic acid molecule may also be introduced into i 
a host cell that is capable of post-translationally modifying the encoded polypeptide. - 
25 Similarly, one may delete sites that are post-translationally modified by either mutating j 
' the nucleic acid sequence so that the encoded polypeptide does not contain the post- - 

translational modification motif, or by introducing the native nucleic acid molecule into a 

i host cell that is not capable of post-translationally modifying the encoded polypeptide. | 

j f 
■■ ' It will be appreciated, as is well known and as noted above, that polypeptides are 

30 not always entirely linear. For instance, polypeptides may be branched as a result of 

ubiquitination, and they may be circular, with or without branching, generally as a result 

of posttranslation events, including natural processing events and events brought about by 

human manipulation which do not occur naturally. Circular, branched and branched 

■1 I 
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circular polypeptides may be synthesized by non-translation natural processes and by 
entirely synthetic methods, as well. Modifications can occur anywhere in a polypeptide, 
including the peptide backbone, the amino acid side-chains and the amino or carboxyl 
termini. In fact, blockage of the amino or carboxyl group in a polypeptide, or both, by a 

5 covalent modification, is common in naturally occurring and synthetic polypeptides and 
such modifications may be present in polypeptides of the present invention, as well. For 
instance, the amino terminal residue of polypeptides made in E. coli, prior to proteolytic 
processing, almost invariably will be N-formylmethionine. 

Useful post-synthetic (and post-translational) modifications include conjugation to 

10 detectable labels, such as fluorophores. A wide variety of amine-reactive and thiol- 
reactive fluorophore derivatives have been synthesized that react under nondenaturing 
conditions with N-terminal amino groups and epsilon amino groups of lysine residues, on 
the one hand, and with free thiol groups of cysteine residues, on the other. 

Kits are available commercially that permit conjugation of proteins to a variety of 

15 amine-reactive or thiol-reactive fluorophores: Molecular Probes, Inc. (Eugene, OR, USA), 
e.g., offers kits for conjugating proteins to Alexa Fluor 350, Alexa Fluor 430, 
Fluorescein-EX, Alexa Fluor 488, Oregon Green 488, Alexa Fluor 532, Alexa Fluor 546, 
Alexa Fluor 546, Alexa Fluor 568, Alexa Fluor 594, and Texas Red-X. 

A wide variety of other amine-reactive and thiol-reactive fluorophores are 

20 available commercially (Molecular Probes, Inc., Eugene, OR, USA), including Alexa 

Fluor® 350, Alexa Fluor® 488, Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 568, 
Alexa Fluor® 594, Alexa Fluor® 647 (monoclonal antibody labeling kits available from 
Molecular Probes, Inc., Eugene, OR, USA), BODIPY dyes, such as BODIPY 493/503, 
BODEPY FL, BODIPY R6G, BODIPY 530/550, BODIPY TMR, BODIPY 558/568, 

25 BODIPY 558/568, BODIPY 564/570, BODIPY 576/589, BODIPY 581/591, BODIPY 
TR, BODIPY 630/650, BODIPY 650/665, Cascade Blue, Cascade Yellow, Dansyl, 
lissamine rhodamine B, Marina Blue, Oregon Green 488, Oregon Green 514, Pacific Blue, 
rhodamine 6G, rhodamine green, rhodamine red, tetramethylrhodamine, Texas Red 
(available from Molecular Probes, Inc., Eugene, OR, USA). 

30 ' The polypeptides of the present invention can also be conjugated to fluorophores, 

other proteins, and other macromolecules, using Afunctional linking reagents. Common 
, homobifunctional reagents include, e.g., APG, AEDP, BASED, BMB, BMDB, BMH, 
BMOE, BM[PEO]3, BM[PEO]4, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, 
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DSG, DSP (Lomant's Reagent), DSS, DST, DTBP, DTME, DTSSP, EGS, HBVS, 
Sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS (all available from Pierce, Rockford, BL, USA); 
common heterobifunctional cross-linkers include ABH, AMAS, ANB-NOS, APDP, 
ASBA, BMP A, BMPH, BMPS, EDC, EMCA, EMCH, EMCS, KMUA, KMUH, GMBS, 
5 LC-SMCC, LC-SPDP, MBS, M2C2H, MPBH, MSA, NHS-ASA, PDPH, PMPI, SADP, 
SAED, SAND, SANPAH, SASD, SATP, SBAP, SFAD, SIA, SIAB, SMCC, SMPB, 
SMPH, SMPT, SPDP, Sulfo-EMCS, Sulfo-GMBS, Sulfo-HSAB, Sulfo-KMUS, 
Sulfo-LC-SPDP, Sulfo-MBS, Sulfo-NHS-LC-ASA, Sulfo-SADP, Sulfo-SANPAH, 
Sulfo-SIAB, Sulfo-SMCC, Sulfo-SMPB, Sulfo-LC-SMPT, SVSB, TFCS (all available 

10 Pierce, Rockford, IL, USA). 

Polypeptides of the present invention, including full length polypeptides, 
fragments and fusion proteins, can be conjugated, using such cross-linking reagents, to 
fluorophores that are not amine- or thiol-reactive. Other labels that usefully can be 
conjugated to polypeptides of the present invention include radioactive labels, 

15 echosonographic contrast reagents, and MRI contrast agents. 

Polypeptides of the present invention, including full length polypeptides, 
fragments and fusion proteins, can also usefully be conjugated using cross-linking agents 
to carrier proteins, such as KLH, bovine thyroglobulin, and even bovine serum albumin 
(BSA), to increase immunogenicity for raising anti-CSP antibodies. 

20 Polypeptides of the present invention, including full length polypeptides, 

fragments and fusion proteins, can also usefully be conjugated to polyethylene glycol 
(PEG); PEGylation increases the serum half life of proteins administered intravenously for 
replacement therapy. Delgado etal, Crit. Rev. Ther. Drug Carrier Syst 9(3-4): 249-304 
(1992); Scott et at., Curr. Pharm. Des. 4(6): 423-38 (1998); DeSantis et at, Cum Opin. 

25 BiotechnoL 10(4): 324-30 (1999). PEG monomers can be attached to the protein directly 
or through a linker, with PEGylation using PEG monomers activated with tresyl chloride 
(2,2,2-trifluoroethanesulphonyl chloride) permitting direct attachment under mild 
conditions. 

Polypeptides of the present invention are also inclusive of analogs of a polypeptide 
30 encoded by a nucleic acid molecule according to the instant invention. In a preferred 

embodiment, this polypeptide is a CSP. In a more preferred embodiment, this polypeptide 
is derived from a polypeptide having part or all of the amino acid sequence of SEQ ID 
NO: 95-248. Also preferred is an analog polypeptide comprising one or more 
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substitutions of non-natural amino acids or non-native inter-residue bonds compared to the 
naturally occurring polypeptide. In one embodiment, the analog is structurally similar to a 
CSP, but one or more peptide linkages is replaced by a linkage selected from the group 
consisting of ~CH 2 NH--, -CH 2 S~, --CH 2 -CH 2 --, -CH=CH-(cis and trans), ~COCH 2 ~, 
5 --CH(OH)CH 2 -- and -CH 2 SO-. In another embodiment, the analog comprises 

substitution of one or more amino acids of a CSP with a D-amino acid of the same type or 
other non-natural amino acid in order to generate more stable peptides. D-amino acids can 
readily be incorporated during chemical peptide synthesis: peptides assembled from 
D-amino acids are more resistant to proteolytic attack; incorporation of D-amino acids can 

10 also be used to confer specific three-dimensional conformations on the peptide. Other 
amino acid analogues commonly added during chemical synthesis include ornithine, 
norleucine, phosphorylated amino acids (typically phosphoserine, phosphothreonine, 
phosphotyrosine), L-malonyltyrosine, a non-hydrolyzable analog of phosphotyrosine (see, 
e.g., Kole et al. y Biochenu Biophys. Res. Com. 209: 817-821 (1995)), and various 

1 5 halogenated phenylalanine derivatives. 

Non-natural amino acids can be incorporated during solid phase chemical synthesis 
or by recombinant techniques, although the former is typically more common. Solid 
phase chemical synthesis of peptides is well established in the art Procedures are 
described, inter alia, in Chan et al. (eds.), Fmoc Solid Phase Peptide Synthesis: A 

20 Practical Approach (Practical Approach Series), Oxford Univ. Press (March 2000); Jones, 
Amino Acid and Peptide Synthesis (Oxford Chemistry Primers, No 7), Oxford Univ. Press 
(1992); and Bodanszky, Principles of Peptide Synthesis (Springer Laboratory), Springer 
Verlag(1993). 

Amino acid analogues having detectable labels are also usefully incorporated 
25 during synthesis to provide derivatives and analogs. Biotin, for example can be added 
using biotinoyl-(9-fluorenylmethoxycarbonyl)-L-lysine (FMOC biocytin) (Molecular 
Probes, Eugene, OR, USA). Biotin can also be added enzymatically by incorporation into 
a fusion protein of an E. coli BirA substrate peptide. The FMOC and /BOC derivatives of 
dabcyl-L-lysine (Molecular Probes, Inc., Eugene, OR, USA) can be used to incorporate 
30 the dabcyl chromophore at selected sites in the peptide sequence during synthesis. The 
atninonaphthalene derivative EDANS, the most common fluorophore for pairing with the 
dabcyl quencher in fluorescence resonance energy transfer (FRET) systems, can be 
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introduced during automated synthesis of peptides by using EDANS-FMOC-L-glutamic 
acid or the corresponding *BOC derivative (both from Molecular Probes, Inc., Eugene, 
OR, USA). Tetramethylrhodamine fluorophores can be incorporated during automated 
FMOC synthesis of peptides using (FMOC)-TMR-L-lysine (Molecular Probes, Inc. 
5 Eugene, OR, USA). 

Other useful amino acid analogues that can be incorporated during chemical 
synthesis include aspartic acid, glutamic acid, lysine, and tyrosine analogues having allyl 
side-chain protection (Applied Biosystems, Inc., Foster City, CA, USA); the allyl side 
chain permits synthesis of cyclic, branched-chain, sulfonated, glycosylated, and 
10 phosphorylated peptides. 

A large number of other FMOC-protected non-natural amino acid analogues 
I capable of incorporation during chemical synthesis are available commercially, including, 

1 e.g., Fmoc-2-aminobicyclo[2.2.1]heptane-2-carboxylic acid, Fmoc-3-endo- 

| aminobicyclo[2.2. 1 ]heptane-2-endo-carboxylic acid, Fmoc-3-exo- 

15 aminobicyclo[2.2.1]heptane-2-exo-carboxylic acid, Fmoc-3-endo-amino- 
| bicyclo[2.2. l]hept-5-ene-2-endo-carboxylic acid, Fmoc-3-exo-amino-bicyclo[2.2. l]hept- 

5-ene-2-exo-carboxylic acid, Fmoc-cis-2-amino-l-cyclohexanecarboxylic acid, Fmoc- 
trans-2-amino-l-cyclohexanecarboxylic acid, Fmoc-1 -amino- 1-cyclopentanecarboxylic 
acid, Fmoc-cis-2-amino-l-cyclopentanecarboxylic acid, Fmoc- 1 -amino- 1- 
20 cyclopropanecarboxylic acid, Fmoc-D-2-amino-4-(ethylthio)butyric acid, Fmoc-L-2- 

amino-4-(ethylthio)butyric acid, Fmoc-L-buthionine, Fmoc-S-methyl-L-Cysteine, Fmoc- 
2-aminobenzoic acid (anthranillic acid), Fmoc-3 -aminobenzoic acid, Fmoc-4- 
aminobenzoic acid, Fmoc-2-aminobenzophenone-2'-carboxylic acid, Fmoc-N-(4- 
aminobenzoyl)-(i-alanine, Fmoc-2-amino-4,5-dimethoxybenzoic acid, Fmoc-4- 
25 aminohippuric acid, Fmoc-2-amino-3-hydroxybenzoic acid, Fmoc-2-amino-5- 
hydroxybenzoic acid, Fmoc-3-amino-4-hydroxyben2X)ic acid, Fmoc-4-amino-3- 
hydroxybenzoic acid, Fmoc-4-amino-2-hydroxybenzoic acid, Fmoc-5-amino-2- 
hydroxybenzoic acid, Fmoc-2-amino-3-methoxybenzoic acid, Fmoc-4-amino-3- 
methoxybenzoic acid, Fmoc-2-amino-3 -methylbenzoic acid, Fmoc-2-amino-5- 
30 methylbenzoic acid, Fmoc-2-amino-6-methylbenzoic acid, Fmoc-3-amino-2- 
methylbenzoic acid, Fmoc-3-amino-4-methylbenzoic acid, Fmoc-4-amino-3- 
I methylbenzoic acid, Fmoc-3-amino-2-naphtoic acid, Fmoc-D,L-3-amino-3- 

phenylpropionic acid, Fmoc-L-Methyldopa, Fmoc-2-amino-4,6-dimethyl-3- 
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pyridinecarboxylic acid, Fmoc-D,L-amino-2-thiophenacetic acid, Fmoc-4- 
(carboxymethyl)piperazine, Fmoc-4-carboxypiperazine, Fmoc-4- 
(carboxymethyl)homopiperazine, Fmoc-4-phenyl-4-piperidinecarboxylic acid, Fraoc-L- 
l,2,3,4-tetrahydronorhannan-3-carboxylic acid, Fmoc-L-thiazolidine-4-carboxylic acid, all 
5 available from The Peptide Laboratory (Richmond, CA, USA). 

Non-natural residues can also be added biosynthetically by engineering a 
suppressor tRNA, typically one that recognizes the UAG stop codon, by chemical 
aminoacylation with the desired unnatural amino acid. Conventional site-directed 
mutagenesis is used to introduce the chosen stop codon UAG at the site of interest in the 
10 protein gene. When the acylated suppressor tRNA and the mutant gene are combined in 
an in vitro transcription/translation system, the unnatural amino acid is incorporated in 
response to the UAG codon to give a protein containing that amino acid at the specified 
position. Liu et ai, Proa Natl Acad. Sci. USA 96(9): 4780-5 (1999); Wang et al. 9 Science 
292(5516): 498-500 (2001). 

1 5 Fusion Proteins 

Another aspect of the present invention relates to the fusion of a polypeptide of the 
present invention to heterologous polypeptides. In a preferred embodiment, the 
polypeptide of the present invention is a CSP. In a more preferred embodiment, the 
polypeptide of the present invention that is fused to a heterologous polypeptide which 

20 comprises part or all of the amino acid sequence of SEQ ID NO: 95-248, or is a mutein, 
homologous polypeptide, analog or derivative thereof. In an even more preferred 
embodiment, the fusion protein is encoded by a nucleic acid molecule comprising all or 
part of the nucleic acid sequence of SEQ ED NO: 1-94, or comprises all or part of a nucleic 
acid sequence that selectively hybridizes or is homologous to a nucleic acid molecule 

25 comprising a nucleic acid sequence of SEQ ID NO: 1-94. 

The fusion proteins of the present invention will include at least one fragment of a 
polypeptide of the present invention, which fragment is at least 6, typically at least 8, often 
at least 15, and usefully at least 16, 17, 18, 19, or 20 amino acids long. The fragment of 
the polypeptide of the present to be included in the fusion can usefully be at least 25 

30 amino acids long, at least 50 amino acids long, and can be at least 75, 100, or even 1 50 
amino acids long. Fusions that include the entirety of a polypeptide of the present 
invention have particular utility. 
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The heterologous polypeptide included within the fusion protein of the present 
invention is at least 6 amino acids in length, often at least 8 amino acids in length, and 
preferably at least 15, 20, or 25 amino acids in length. Fusions that include larger 
polypeptides, such as the IgG Fc region, and even entire proteins (such as GFP 
5 chromophore-containing proteins) are particularly useful. 

As described above in the description of vectors and expression vectors of the 
present invention, which discussion is incorporated here by reference in its entirety, 
heterologous polypeptides to be included in the fusion proteins of the present invention 
can usefully include those designed to facilitate purification and/or visualization of 

10 recombinantly-expressed proteins. See, e.g., Ausubel, Chapter 16, (1992), supra. 
Although purification tags can also be incorporated into fusions that are chemically 
synthesized, chemical synthesis typically provides sufficient purity that further 
purification by HPLC suffices; however, visualization tags as above described retain their 
utility even when the protein is produced by chemical synthesis, and when so included 

15 render the fusion proteins of the present invention useful as directly detectable markers of 
the presence of a polypeptide of the invention. 

As also discussed above, heterologous polypeptides to be included in the fusion 
proteins of the present invention can usefully include those that facilitate secretion of 
recombinantly expressed proteins into the periplasmic space or extracellular milieu for 

20 prokaryotic hosts or into the culture medium for eukaryotic cells through incorporation of 
secretion signals and/or leader sequences. For example, a His 6 tagged protein can be 
purified on a Ni affinity column and a GST fusion protein can be purified on a glutathione 
affinity column. Similarly, a fusion protein comprising the Fc domain of IgG can be 
purified on a Protein A or Protein G column and a fusion protein comprising an epitope 

25 tag such as myc can be purified using an immunoaffinity column containing an anti-c-myc 
antibody. It is preferable that the epitope tag be separated from the protein encoded by the 
essential gene by an enzymatic cleavage site that can be cleaved after purification. See 
also the discussion of nucleic acid molecules encoding fusion proteins that may be 
expressed on the surface of a cell. 

30 Other useful fusion proteins of the present invention include those that permit use 

of the polypeptide of the present invention as bait in a yeast two-hybrid system. See 
Bartel et al (eds.), The Yeast Two-Hvbrid System. Oxford University Press (1997); Zhu 
et al, Yeast Hybrid Technologies. Eaton Publishing (2000); Fields et al, Trends Genet. 
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10(8): 286-92 (1994); Mendelsohn et al, Curr. Opin. Biotechnol 5(5): 482-6 (1994); 

Luban et al, Curr. Opin. Biotechnol. 6(1): 59-64 (1995); Allen et al, Trends Biochem. 

Sci 20(12): 511-6 (1995); Drees, Cwi\ Opin. Chem. Biol 3(1): 64-70 (1999); Topcu et 

al, Pliarm. Res. 17(9): 1049-55 (2000); Fashena et al, Gene 250(1-2): 1-14 (2000); Colas 
5 et al, Nature 380, 548-550 (1996); Norman, T. et al, Science 285, 591-595 (1999); 

Fabbrizio et al, Oncogene 18, 4357-4363 (1999); Xu et al, Proc Natl Acad Sci U SA. 

94, 12473-12478 (1997); Yang, et al, Nuc. Acids Res. 23, 1 152-1 156 (1995); Kolonin et 

al, Proc Natl Acad Sci USA95, 14266-14271 (1998); Cohen et al , Proc Natl Acad Sci U 

SA 95, 14272-14277 (1998); Uetz, et al Nature 403, 623-627(2000); Ito, et al, Proc Natl 
10 Acad Sci USA9Z, 4569-4574 (2001). Typically, such fusion is to either E. coli LexA or 

yeast GAL4 DNA binding domains. Related bait plasmids are available that express the 

bait fused to a nuclear localization signal. 

Other useful fusion proteins include those that permit display of the encoded 

polypeptide on the surface of a phage or cell, fusions to intrinsically fluorescent proteins, 
15 such as green fluorescent protein (GFP), and fusions to the IgG Fc region, as described 

above. 

The polypeptides of the present invention can also usefully be fused to protein 
toxins, such as Pseudomonas exotoxin A, diphtheria toxin, shiga toxin A, anthrax toxin 
lethal factor, or ricin, in order to effect ablation of cells that bind or take up the proteins of 

20 the present invention. 

Fusion partners include, inter alia, myc, hemagglutinin (HA), GST, 
immunoglobulins, 0-galactosidase, biotin trpE, protein A, p-lactamase, a-amylase, 
maltose binding protein, alcohol dehydrogenase, polyhistidine (for example, six histidine 
at the amino and/or carboxyl terminus of the polypeptide), lacZ, green fluorescent protein 

25 (GFP), yeast a mating factor, GAL4 transcription activation or DNA binding domain, 
luciferase, and serum proteins such as ovalbumin, albumin and the constant domain of 
IgG. See, eg., Ausubel (1992), supra and Ausubel (1999), supra. Fusion proteins may 
also contain sites for specific enzymatic cleavage, such as a site that is recognized by 
enzymes such as Factor XIII, trypsin, pepsin, or any other enzyme known in the art. 

30 Fusion proteins will typically be made by either recombinant nucleic acid methods, as 
described above, chemically synthesized using techniques well known in the art (eg., a 
Merrifield synthesis), or produced by chemical cross-linking. 
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Another advantage of fusion proteins is that the epitope tag can be used to bind the 
fusion protein to a plate or column through an affinity linkage for screening binding 
proteins or other molecules that bind to the CSP. 

As further described below, the polypeptides of the present invention can readily 
5 be used as specific immunogens to raise antibodies that specifically recognize 
polypeptides of the present invention including CSPs and their allelic variants and 
homologues. The antibodies, in turn, can be used, inter alia, specifically to assay for the 
polypeptides of the present invention, particularly CSPs, e.g. by ELISA for detection of 
protein fluid samples, such as serum, by immunohistochemistry or laser scanning 

10 cytometry, for detection of protein in tissue samples, or by flow cytometry, for detection 
of intracellular protein in cell suspensions, for specific antibody-mediated isolation and/or 
purification of CSPs, as for example by immunoprecipitation, and for use as specific 
agonists or antagonists of CSPs. 

One may determine whether polypeptides of the present invention including CSPs, 

1 5 muteins, homologous proteins or allelic variants or fusion proteins of the present invention 
are functional by methods known in the art. For instance, residues that are tolerant of 
change while retaining function can be identified by altering the polypeptide at known 
residues using methods known in the art, such as alanine scanning mutagenesis, 
Cunningham et al, Science 244(4908): 1081-5 (1989); transposon linker scanning 

20 mutagenesis, Chen et al , Gene 263(1 -2): 39-48 (200 1 ); combinations of homolog- and 
alanine-scanning mutagenesis, Jin etal, J. Mol Biol 226(3): 851-65 (1992); and 
combinatorial alanine scanning, Weiss et al, Proc. Natl Acad. Sci USA 97(16): 8950-4 
(2000), followed by functional assay. Transposon linker scanning kits are available 
commercially (New England Biolabs, Beverly, MA, USA, catalog, no. E7-102S; 

25 EZ::TN™ In-Frame Linker Insertion Kit, catalogue no. EZI04KN, (Epicentre 
Technologies Corporation, Madison, WI, USA). 

Purification of the polypeptides or fusion proteins of the present invention is well 
known and within the skill of one having ordinary skill in the art. See, e,g. t Scopes, 
Protein Purification. 2d ed. (1987). Purification of recombinantly expressed polypeptides 

30 is described above. Purification of chemically-synthesized peptides can readily be 
effected, e.g., by HPLC. 

Accordingly, it is an aspect of the present invention to provide the isolated 
polypeptides or fusion proteins of the present invention in pure or substantially pure form 
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in the presence or absence of a stabilizing agent. Stabilizing agents include both 
proteinaceous and non-proteinaceous material and are well known in the art Stabilizing 
agents, such as albumin and polyethylene glycol (PEG) are known and are commercially 
available. 

5 Although high levels of purity are preferred when the isolated polypeptide or 

fusion protein of the present invention are used as therapeutic agents, such as in vaccines 
arid replacement therapy, the isolated polypeptides of the present invention are also useful 
at lower purity. For example, partially purified polypeptides of the present invention can 
be used as immunogens to raise antibodies in laboratory animals. 

10 In a preferred embodiment, the purified and substantially purified polypeptides of 

the present invention are in compositions that lack detectable ampholytes, acrylamide 
monomers, bis-acrylamide monomers, and polyacrylamide. 

The polypeptides or fusion proteins of the present invention can usefully be 
attached to a substrate. The substrate can be porous or solid, planar or non-planar, the 

1 5 bond can be covalent or noncovalent. For example, the peptides of the invention may be 
stabilized by covalent linkage to albumin. See, U.S. Patent No. 5,876,969, the contents of 
which are hereby incorporated in its entirety. 

The polypeptides or fusion proteins of the present invention can also be usefully 
bound to a porous substrate, commonly a membrane, typically comprising nitrocellulose, 

20 polyvinylidene fluoride (PVDF), or cationically derivatized, hydrophilic PVDF; so bound, 
the polypeptides or fusion proteins of the present invention can be used to detect and 
quantify antibodies, e.g. in serum, that bind specifically to the immobilized polypeptide or 
fusion protein of the present invention. 

As another example, the polypeptides or fusion proteins of the present invention 

25 can usefully be bound to a substantially nonporous substrate, such as plastic, to detect and 
quantify antibodies, eg. in serum, that bind specifically to the immobilized protein of the 
present invention. Such plastics include polymethylacrylic, polyethylene,' polypropylene, 
polyacrylate, polymethylmethacrylate, polyvinylchloride, polytetrafluoroethylene, 
polystyrene, polycarbonate, polyacetal, polysulfone, celluloseacetate, cellulosenitrate, 

30 nitrocellulose, or mixtures thereof; when the assay is performed in a standard microtiter 
dish, the plastic is typically polystyrene. 

The polypeptides and fusion proteins of the present invention can also be attached 
to a substrate suitable for use as a surface enhanced laser desorption ionization source; so 
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attached, the polypeptide or fusion protein of the present invention is useful for binding 
and then detecting secondary proteins that bind with sufficient affinity or avidity to the 
surface-bound polypeptide or fusion protein to indicate biologic interaction there between. 
The polypeptides or fusion proteins of the present invention can also be attached to a 
5 substrate suitable for use in surface plasmon resonance detection; so attached, the 
polypeptide or fusion protein of the present invention is useful for binding and then 
detecting secondary proteins that bind with sufficient affinity or avidity to the surface- 
bound polypeptide or fusion protein to indicate biological interaction there between. 

Alternative Transcripts 

10 In antother aspect, the present invention provides splice variants of genes and 

proteins encoded thereby. The identification of a novel splice variant which encodes an 
amino acid sequence with a novel region can be targeted for the generation of reagents for 
use in detection and/or treatment of cancer. The novel amino acid sequence may lead to a 
unique protein structure, protein subcellular localization, biochemical processing or 

1 5 function of the splice variant. This information can be used to directly or indirectly 

facilitate the generation of additional or novel therapeutics or diagnostics. The nucleotide 
sequence in this novel splice variant can be used as a nucleic acid probe for the diagnosis 
and/or treatment of cancer. 

Specifically, the newly identified sequences may enable the production of new 

20 antibodies or compounds directed against the novel region for use as a therapeutic or 
diagnostic. Alternatively, the newly identified sequences may alter the biochemical or 
biological properties of the encoded protein in such a way as to enable the generation of 
improved or different therapeutics targeting this protein. 

Antibodies 

25 In another aspect, the invention provides antibodies, including fragments and 

derivatives thereof, that bind specifically to polypeptides encoded by the nucleic acid 
molecules of the invention. In a preferred embodiment, the antibodies are specific for a 
polypeptide that is a CSP, or a fragment, mutein, derivative, analog or fusion protein 
thereof. In a more preferred embodiment, the antibodies are specific for a polypeptide that 

30 comprises SEQ ID NO: 95-248, or a fragment, mutein, derivative, analog or fusion protein 
thereof. 
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The antibodies of the present invention can be specific for linear epitopes, 
discontinuous epitopes, or conformational epitopes of such proteins or protein fragments, 
either as present on the protein in its native conformation or, in some cases, as present on 
the proteins as denatured, as, e.g. , by solubilization in SDS. New epitopes may also be 
5 due to a difference in post translational modifications (PTMs) in disease versus normal 
tissue. For example, a particular site on a CSP may be glycosylated in cancerous cells, but 
not glycosylated in normal cells or vice versa. In addition, alternative splice forms of a 
CSP may be indicative of cancer. Differential degradation of the C or N-terminus of a 
CSP may also be a marker or target for anticancer therapy. For example, a CSP may be 

10 N-terminal degraded in cancer cells exposing new epitopes to antibodies which may 
selectively bind for diagnostic or therapeutic uses. 

As is well known in the art, the degree to which an antibody can discriminate 
among molecular species in a mixture will depend, in part, upon the conformational 
relatedness of the species in the mixture; typically, the antibodies of the present invention 

1 5 will discriminate over adventitious binding to non-CSP polypeptides by at least two-fold, 
more typically by at least 5-fold, typically by more than 10-fold, 25-fold, 50-fold, 75-fold, 
and often by more than 100-fold, and on occasion by more than 500-fold or 1000-fold. 
When used to detect the proteins or protein fragments of the present invention, the 
antibody of the present invention is sufficiently specific when it can be used to determine 

20 the presence of the polypeptide of the present invention in samples derived from human 
colon. 

Typically, the affinity or avidity of an antibody (or antibody multimer, as in the 
case of an IgM pentamer) of the present invention for a protein or protein fragment of the 
present invention will be at least about 1 x 10" 6 molar (M), typically at least about 5 x 10* 7 
25 M, 1 x 10* 7 M, with affinities and avidities of at least 1 x 10" 8 M, 5 x 10~ 9 M, 1 x 10* !0 M 
and up to 1 X 10" 13 M proving especially useful. 

The antibodies of the present invention can be naturally occurring forms, such as 
IgG, IgM, IgD, IgE, IgY, and IgA, from any avian, reptilian, or mammalian species. 

Human antibodies can, but will infrequently, be drawn directly from human donors 
30 or human cells. In such case, antibodies to the polypeptides of the present invention will 
typically have resulted from fortuitous immunization, such as autoimmune immunization, 
with the polypeptide of the present invention. Such antibodies will typically, but will not 
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invariably, be polyclonal In addition, individual polyclonal antibodies may be isolated 
and cloned to generate monoclonals. 

Human antibodies are more frequently obtained using transgenic animals that 
express human immunoglobulin genes, which transgenic animals can be affirmatively 
5 immunized with the protein immunogen of the present invention. Human Ig-transgenic 
mice capable of producing human antibodies and methods of producing human antibodies 
therefrom upon specific immunization are described, inter alia, in U.S. Patent Nos. 
6,162,963; 6,150,584; 6,114,598; 6,075,181; 5,939,598; 5,877,397; 5,874,299; 5,814,318; 
5,789,650; 5,770,429; 5,661,016; 5,633,425; 5,625,126; 5,569,825; 5,545,807; 5,545,806, 

10 and 5,591,669, the disclosures of which are incorporated herein by reference in their 
entireties. Such antibodies are typically monoclonal, and are typically produced using 
techniques developed for production of murine antibodies. 

Human antibodies are particularly useful, and often preferred, when the antibodies 
of the present invention are to be administered to human beings as in vivo diagnostic or 

15 therapeutic agents, since recipient immune response to the administered antibody will 
often be substantially less than that occasioned by administration of an antibody derived 
from another species, such as mouse. 

IgG, IgM, IgD, IgE, IgY, and IgA antibodies of the present invention are also 
usefully obtained from other species, including mammals such as rodents (typically 

20 mouse, but also rat, guinea pig, and hamster), lagomorphs (typically rabbits), and also 
larger mammals, such as sheep, goats, cows, and horses; or egg laying birds or reptiles 
such as chickens or alligators. In such cases, as with the transgenic human-antibody- 
producing non-human mammals, fortuitous immunization is not required, and the non- 
human mammal is typically affirmatively immunized, according to standard immunization 

25 protocols, with the polypeptide of the present invention. One form of avian antibodies 
may be generated using techniques described in WO 00/29444, published 25 May 2000, 
which is herein incorporated by reference in its entirety. 

As discussed above, virtually all fragments of 8 or more contiguous amino acids of 
a polypeptide of the present invention can be used effectively as immunogens when 

30 conjugated to a carrier, typically a protein such as bovine thyroglobulin, keyhole limpet 
hemocyanin, or bovine serum albumin, conveniently using a bifunctional linker such as 
those described elsewhere above, which discussion is incorporated by reference here. 
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Immunogenicity can also be conferred by fusion of the polypeptides of the present 
invention to . other moieties. For example, polypeptides of the present invention can be 
produced by solid phase synthesis on a branched polylysine core matrix; these multiple 
antigenic peptides (MAPs) provide high purity, increased avidity, accurate chemical 
5 definition and improved safety in vaccine development. Tarn et ai 9 Proc. Natl Acad. Sci. 
USA 85: 5409-5413 (1988); Posnett et al.,J. Biol. Chem. 263: 1719-1725 (1988). 

Protocols for immunizing non-human mammals or avian species are well- 
established in the art. See Harlow et al. (eds.), Using Antibodies: A Laboratory Manual 
Cold Spring Harbor Laboratory (1998); Coligan et al. (eds.), Current Protocols in 

10 Immunology, John Wiley & Sons, Inc. (2001); Zola, Monoclonal Antibodies: Preparation 
and Use of Monoclonal Antibodies and Engineered Antibody Derivatives (Basics: From 
Background to Bench), Springer Verlag (2000); Gross M, Speck J.Dtsch. Tierarztl 
Wochenschr. 103: 417-422 (1996). Immunization protocols often include multiple 
immunizations, either with or without adjuvants such as Freund's complete adjuvant and 

15 Freund's incomplete adjuvant, and may include naked DNA immunization. Moss, Semin. 
Immunol. 2:317-327(1990). 

Antibodies from non-human mammals and avian species can be polyclonal or 
monoclonal, with polyclonal antibodies having certain advantages in 
immunohistochemical detection of the polypeptides of the present invention and 

20 monoclonal antibodies having advantages in identifying and distinguishing particular 

epitopes of the polypeptides of the present invention. Antibodies from avian species may 
have particular advantage in detection of the polypeptides of the present invention, in 
human serum or tissues. Vikinge et al., Biosetis. Bioelectron. 13: 1257-1262 (1998). 
Following immunization, the antibodies of the present invention can be obtained using any 

25 art-accepted technique. Such techniques are well known in the art and are described in 
detail in references such as Coligan, supra; Zola, supra; Howard et al. (eds.), Basic 
Methods in Antibody Production and Characterization , CRC Press (2000); Harlow, supra; 
Davis (ed.), Monoclonal Antibody Protocols, Vol. 45, Humana Press (1995); Delves (ed.), 
Antibody Production: Essential Techniques , John Wiley & Son Ltd (1997); and Kenney, 

30 Antibody Solution: An Antibody Methods Manual Chapman & Hall (1997). 

Briefly, such techniques include, inter alia, production of monoclonal antibodies 
by hybridomas and expression of antibodies or fragments or derivatives thereof from host 
cells engineered to express immunoglobulin genes or fragments thereof. These two 
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methods of production are not mutually exclusive: genes encoding antibodies specific for 
the polypeptides of the present invention can be cloned from hybridomas and thereafter 
expressed in other host cells. Nor need the two necessarily be performed together: e.g., 
genes encoding antibodies specific for the polypeptides of the present invention can be 
5 cloned directly from B cells known to be specific for the desired protein, as further 

described in U.S. Patent No. 5,627,052, the disclosure of which is incorporated herein by 
reference in its entirety, or from antibody-displaying phage. 

Recombinant expression in host cells is particularly useful when fragments or 
derivatives of the antibodies of the present invention are desired. 

10 Host cells for recombinant antibody production of whole antibodies, antibody 

fragments, or antibody derivatives can be prokaryotic or eukaryotic. 

Prokaryotic hosts are particularly useful for producing phage displayed antibodies 
of the present invention. 

The technology of phage-displayed antibodies, in which antibody variable region 

15 fragments are fused, for example, to the gene III protein (pill) or gene VHI protein (pVIII) 
for display on the surface of filamentous phage, such as Ml 3, is by now well-established. 
See, e.g., Sidhu, Curr. Opin. Biotechnol 11(6): 610-6 (2000); Griffiths etal, Curr. Opin. 
Biotechnol 9(1): 102-8 (1998); Hoogenboom etal, Immunotechnology, 4(1): 1-20 (1998); 
Rader et al. t Current Opinion in Biotechnology 8: 503-508 (1997); Aujame et al 9 Human 

20 Antibodies 8: 155-168 (1997); Hoogenboom, Trends in Biotechnol. 15: 62-70 (1997); de 
Kruif et aL % 17: 453-455 (1996); Barbas et al y Trends in Biotechnol 14: 230-234 (1996); 
Winter et al,Ann. Rev. Immunol 433-455 (1994). Techniques and protocols required to 
generate, propagate, screen (pan), and use the antibody fragments from such libraries have 
recently been compiled. See, e.g., Barbas (2001), supra; Kay, supra; and Abelson, supra. 

25 Typically, phage-displayed antibody fragments are scFv fragments or Fab 

fragments; when desired, full length antibodies can be produced by cloning the variable 
regions from the displaying phage into a complete antibody and expressing the full length 
antibody in a further prokaryotic or a eukaryotic host cell. Eukaryotic cells are also useful 
for expression of the antibodies, antibody fragments, and antibody derivatives of the 

30 present invention. For example, antibody fragments of the present invention can be 

produced in Pichia pastoris and in Saccharomyces cerevisiae. See, e.g. f Takahashi et al 9 
Biosci. Biotechnol Biochem. 64(10): 2138-44 (2000); Freyre et al, J. Biotechnol. 
76(2-3): 1 57-63 (2000); Fischer et al, Biotechnol Appl. Biochem. 30 (Pt 2): 1 17-20 
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(1999); Pennell etaL, Res. Immunol 149(6): 599-603 (1998); Eldin et al.,J: Immunol 
Methods. 201(1): 67-75 (1997);, Frenken etal, Res. Immunol 149(6): 589-99 (1998); and 
Shusta et al, Nature Biotechnol. 16(8): 773-7 (1998). 

Antibodies, including antibody fragments and derivatives, of the present invention 
5 can also be produced in insect cells. See, e.g., Li et al, Protein Expr. Purif. 21(1): 121-8 
(2001); Ailor et al, Biotechnol Bioeng. 58(2-3): 196-203 (1998); Hsu et al, Biotechnol 
Prog. 13(1): 96-104 (1997); Edelman et al, Immunology 91(1): 13-9 (1997); and Nesbit et 
al, J. Immunol Methods 151(1-2): 201-8 (1992). 

Antibodies and fragments and derivatives thereof of the present invention can also 

10 be produced in plant cells, particularly maize or tobacco, Giddings et al , Nature 

Biotechnol 18(11): 1151-5 (2000); Gavilondo al, Biotechniques 29(1): 128-38(2000); 
Fischer et al,J. Biol Regul Homeost. Agents 14(2): 83-92 (2000); Fischer et al, 
Biotechnol Appl Biochem. 30 (Pt 2): 113-6 (1999); Fischer et al, Biol Cliem. 380(7-8): 
825-39 (1999); Russell, Curr. Top. Microbiol Immunol 240: 119-38 (1999); and Ma et 

15 al, Plant Physiol 109(2): 341-6 (1995). 

Antibodies, including antibody fragments and derivatives, of the present invention 
can also be produced in transgenic, non-human, mammalian milk. See, e.g. Pollock et al., 
J. Immunol Methods. 231: 147-57 (1999); Young etal., Res. Immunol 149:609-10 
(1998); and Limonta et al., Immunotechnology 1:107-13 (1995). 

20 Mammalian cells useful for recombinant expression of antibodies, antibody 

fragments, and antibody derivatives of the present invention include CHO cells, COS 
cells, 293 cells, and myeloma cells. Verma et al, J. Immunol Methods 216(1-2):165-81 

(1998) review and compare bacterial, yeast, insect and mammalian expression systems for 
expression of antibodies. Antibodies of the present invention can also be prepared by cell 

25 free translation, as further described in Merk et al, J. Biochem. (Tokyo) 125(2): 328-33 

(1999) and Ryabova et al, Nature Biotechnol 15(1): 79-84 (1997), and in the milk of 
transgenic animals, as further described in Pollock et al, J. Immunol Methods 231(1-2): 
147-57(1999). 

The invention further provides antibody fragments that bind specifically to one or 
30 more of the polypeptides of the present invention or to one or more of the polypeptides 
encoded by the isolated nucleic acid molecules of the present invention, or the binding of 
which can be competitively inhibited by one or more of the polypeptides of the present 
invention or one or more of the polypeptides encoded by the isolated nucleic acid 
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molecules of the present invention. Among such useful fragments are Fab, Fab\ Fv, 
F(ab)'2, and single chain Fv (scFv) fragments. Other useful fragments are described in 
Hudson, Curr. Opin. Biotechnol 9(4): 395-402 (1998). 

The present invention also relates to antibody derivatives that bind specifically to 
5 one or more of the polypeptides of the present invention, to one or more of the 

polypeptides encoded by the isolated nucleic acid molecules of the present invention, or 
the binding of which can be competitively inhibited by one or more of the polypeptides of 
the present invention or one or more of the polypeptides encoded by the isolated nucleic 
acid molecules of the present invention. 
10 Among such useful derivatives are chimeric, primatized, and humanized 

antibodies; such derivatives are less immunogenic in human beings, and thus are more 
suitable for in vivo administration, than are unmodified antibodies from non-human 
mammalian species. Another useful method is PEGylation to increase the serum half life 
of the antibodies. 

1 5 Chimeric antibodies typically include heavy and/or light chain variable regions 

(including both CDR and framework residues) of immunoglobulins of one species, 
typically mouse, fused to constant regions of another species, typically human. See, e.g., 
Morrison et aL, Proc. Natl Acad. Sci USAM{2\): 6851-5 (1984); Sharon et al, Nature 
309(5966): 364-7 (1984); Takeda et aL, Nature 314(6010): 452-4 (1985); and U.S. Patent 

20 No. 5,807,715 the disclosure of which is incorporated herein by reference in its entirety. 
Primatized and humanized antibodies typically include heavy and/or light chain CDRs 
from a murine antibody grafted into a non-human primate or human antibody V region 
framework, usually further comprising a human constant region, Riechmann et aL, Nature 
332(6162): 323-7 (1988); Co et aL, Nature 351(6326): 501-2 (1991); and U.S. Patent Nos. 

25 6,054,297; 5,821,337; 5,770,196; 5,766,886; 5,821,123; 5,869,619; 6,180,377; 6,013,256; 
5,693,761; and 6,180,370, the disclosures of which are incorporated herein by reference in 
their entireties. Other useful antibody derivatives of the invention include heteromeric 
antibody complexes and antibody fusions, such as diabodies (bispecific antibodies), 
single-chain diabodies, and intrabodies. 

30 It is contemplated that the nucleic acids encoding the antibodies of the present 

invention can be operably joined to other nucleic acids forming a recombinant vector for 
cloning or for expression of the antibodies of the invention. Accordingly, the present 
invention includes any recombinant vector containing the coding sequences, or part 
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thereof, whether for eukaryotic transduction, transfection or gene therapy. Such vectors 
may be prepared using conventional molecular biology techniques, known to those with 
skill in the art, and would comprise DNA encoding sequences for the immunoglobulin V- 
regions including framework and CDRs or parts thereof, and a suitable promoter either 
5 with or without a signal sequence for intracellular transport Such vectors may be 

transduced or transfected into eukaryotic cells or used for gene therapy (Marasco et aL, 
Proc. Natl Acad. Sci. (USA) 90: 7889-7893 (1993); Duan et aL, Proc. Natl. Acad. ScL 
(USA) 9 1 : 5075-5079 (1 994), by conventional techniques, known to those with skill in 
the art. 

10 The antibodies of the present invention, including fragments and derivatives 

thereof, can usefully be labeled. It is, therefore, another aspect of the present invention to 
provide labeled antibodies that bind specifically to one or more of the polypeptides of the 
present invention, to one or more of the polypeptides encoded by the isolated nucleic acid 
molecules of the present invention, or the binding of which can be competitively inhibited 

15 by one or more of the polypeptides of the present invention or one or more of the 

polypeptides encoded by the isolated nucleic acid molecules of the present invention. The 
choice of label depends, in part, upon the desired use. 

For example, when the antibodies of the present invention are used for 
immunohistochemical staining of tissue samples, the label can usefully be an en2yme that 

20 catalyzes production and local deposition of a detectable product. Enzymes typically 
conjugated to antibodies to permit their immunohistochemical visualization are well 
known, and include alkaline phosphatase, p-galactosidase, glucose oxidase, horseradish 
peroxidase (HRP), and urease. Typical substrates for production and deposition of 
visually detectable products include o-iutrophenyl-beta-D-galactopyranoside (ONPG); 

25 o-phenylenediamine dihydrochloride (OPD); p-nitrophenyl phosphate (PNPP); p- 

nitrophenyl-beta-D-galactopryanoside (PNPG); 3',3'-diaminobenzidine (DAB); 3-amino- 
9-ethylcarbazole (AEC); 4-chloro-l-naphthol (CN); 

5-bromo-4-chloro-3-indolyl-phosphate (BCIP); ABTS®; BluoGal; iodonitrotetrazolium 
(INT); nitroblue tetrazolium chloride (NBT); phenazine methosulfate (PMS); 
30 phenolphthalein monophosphate (PMP); tetramethyl benzidine (TMB); tetranitroblue 
tetrazolium (TNBT); X-Gal; X-Gluc; and X-Glucoside. 

Other substrates can be used to produce products for local deposition that are 
luminescent. For example, in the presence of hydrogen peroxide (H2O2), horseradish 
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peroxidase (HHP) can catalyze the oxidation of cyclic diacylhydrazides, such as luminol. 
Immediately following the oxidation, the luminol is in an excited state (intermediate 
reaction product), which decays to the ground state by emitting light. Strong enhancement 
of the light emission is produced by enhancers, such as phenolic compounds. Advantages 
5 include high sensitivity, high resolution, and rapid detection without radioactivity and 
requiring only small amounts of antibody. See, e.g. , Thorpe et al , Methods EnzymoL 133: 
33 1-53 (1986); Kricka et ai, J. Immunoassay 17(1): 67-83 (1996); and Lundqvist et ai y J. 
Biolumin. Chemilumin. 10(6): 353-9 (1995). Kits for such enhanced chemiluminescent 
detection (ECL) are available commercially. The antibodies can also be labeled using 
10 colloidal gold. 

As another example, when the antibodies of the present invention are used, e.g. , for 
flow cytometric detection, for scanning laser cytometric detection, or for fluorescent 
immunoassay, they can usefully be labeled with fluorophores. There are a wide variety of 
fluorophore labels that can usefully be attached to the antibodies of the present invention. 

1 5 For flow cytometric applications, both for extracellular detection and for intracellular 
detection, common useful fluorophores can be fluorescein isothiocyanate (FITC), 
allophycocyanin (APC), R-phycoerythrin (PE), peridinin chlorophyll protein (PerCP), 
Texas Red, Cy3, Cy5, fluorescence resonance energy tandem fluorophores such as PerCP- 
Cy5.5, PE-Cy5, PE-Cy5.5, PE-Cy7, PE-Texas Red, and APC-Cy7. 

20 Other fluorophores include, inter alia, Alexa Fluor® 350, Alexa Fluor® 488, 

Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 568, Alexa Fluor® 594, Alexa 
Fluor® 647 (monoclonal antibody labeling kits available from Molecular Probes, Inc., 
Eugene, OR, USA), BODIPY dyes, such as BODEPY 493/503, BODIPY FL, BODIPY 
R6G, BODIPY 530/550, BODIPY TMR, BODIPY 558/568, BODIPY 558/568, BODIPY 

25 564/570, BODIPY 576/589, BODIPY 581/591, BODIPY TR, BODIPY 630/650, 
BODIPY 650/665, Cascade Blue, Cascade Yellow, Dansyl, lissamine rhodamine B, 
Marina Blue, Oregon Green 488, Oregon Green 514, Pacific Blue, rhodamine 6G, 
rhodamine green, rhodamine red, tetramethylrhodamine, Texas Red (available from 
Molecular Probes, Inc., Eugene, OR, USA), and Cy2, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, all of 

30 which are also useful for fluorescently labeling the antibodies of the present invention. 
For secondary detection using labeled avidin, streptavidin, captavidin or neutravidin, the 
antibodies of the present invention can usefully be labeled with biotin. 
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When the antibodies of the present invention are used, e.g., for western blotting 
applications, they can usefully be labeled with radioisotopes, such as 33 P, 32 P, 35 S, 3 H, and 
,25 I. As another example, when the antibodies of the present invention are used for 
radioimmunotherapy, the label can usefully be 228 Th, ~ 7 Ac, 225 Ac, 223 Ra, 2l3 Bi, 212 Pb, 
5 212 Bi, 21! At, 203 Pb, l94 Os, 188 Re, 186 Re, ,53 Sm, l49 Tb, 131 1, 125 I, l,1 In, ,05 Rh, 99m Tc, 97 Ru, 90 Y, 
90 Sr, 88 Y, 72 Se, 67 Cu,or 47 Sc. 

As another example, when the antibodies of the present invention are to be used 
for in vivo diagnostic use, they can be rendered detectable by conjugation to MRI contrast 
agents, such as gadolinium diethylenetriaminepentaacetic acid (DTP A), Lauffer et al. 9 

10 Radiology 207(2): 529-38 (1998), or by radioisotopic labeling. 

As would be understood, use of the labels described above is not restricted to the 
application as for which they were mentioned. 

The antibodies of the present invention, including fragments and derivatives 
thereof, can also be conjugated to toxins, in order to target the toxin's ablative action to 

15 cells that display and/or express the polypeptides of the present invention. Commonly, the 
antibody in such immunotoxins is conjugated to Pseudomonas exotoxin A, diphtheria 
toxin, shiga toxin A, anthrax toxin lethal factor, or ricin. See Hall (ed.), Immunotoxin 
Methods and Protocols (Methods in Molecular Biology, vol. 166), Humana Press (2000); 
and Frankel et al. (eds.), Clinical Applications of Immunotoxins, Springer- Verlag (1998). 

20 The antibodies of the present invention can usefully be attached to a substrate, and 

it is, therefore, another aspect of the invention to provide antibodies that bind specifically 
to one or more of the polypeptides of the present invention, to one or more of the 
polypeptides encoded by the isolated nucleic acid molecules of the present invention, or 
the binding of which can be competitively inhibited by one or more of the polypeptides of 

25 the present invention or one or more of the polypeptides encoded by the isolated nucleic 
acid molecules of the present invention, attached to a substrate. Substrates can be porous 
or nonporous, planar or nonplanar. For example, the antibodies of the present invention 
can usefully be conjugated to filtration media, such as NHS-activated Sepharose or CNBr- 
activated Sepharose for purposes of immunoaffmity chromatography. For example, the 

30 antibodies of the present invention can usefully be attached to paramagnetic microspheres, 
typically by biotin-streptavidin interaction, which microsphere can then be used for 
isolation of cells that express or display the polypeptides of the present invention. As 
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another example, the antibodies of the present invention can usefully be attached to the 
surface of a microtiter plate for ELISA. 

As noted above, the antibodies of the present invention can be produced in 
prokaryotic and eukaiyotic cells. It is, therefore, another aspect of the present invention to 
5 provide cells that express the antibodies of the present invention, including hybridoma 
cells, B cells, plasma cells, and host cells recombinantly modified to express the 
antibodies of the present invention. 

In yet a further aspect, the present invention provides aptamers evolved to bind 
specifically to one or more of the CSPs of the present invention or to polypeptides 
1 0 encoded by the CSNAs of the invention. 

In sum, one of skill in the art, provided with the teachings of this invention, has 
available a variety of methods which may be used to alter the biological properties of the 
antibodies of this invention including methods which would increase or decrease the 
stability or half-life, immunogenicity, toxicity, affinity or yield of a given antibody 
15 molecule, or to alter it in any other way that may render it more suitable for a particular 
application. 

Transgenic Animals and Cells 

In another aspect, the invention provides transgenic cells and non-human 
organisms comprising nucleic acid molecules of the invention. In a preferred 

20 embodiment, the transgenic cells and non-human organisms comprise a nucleic acid 

molecule encoding a CSP. In a preferred embodiment, the CSP comprises an amino acid 
sequence selected from SEQ ID NO: 95-248, or a fragment, mutein, homologous protein 
or allelic variant thereof. In another preferred embodiment, the transgenic cells and non- 
human organism comprise a CSNA of the invention, preferably a CSNA comprising a 

25 nucleotide sequence selected from the group consisting of SEQ ID NO: 1-94, or a part, 
substantially similar nucleic acid molecule, allelic variant or hybridizing nucleic acid 
molecule thereof. 

In another embodiment, the transgenic cells and non-human organisms have a 
targeted disruption or replacement of the endogenous orthologue of the human CSG. The 
30 transgenic cells can be embryonic stem cells or somatic cells. The transgenic non-human 
organisms can be chimeric, nonchimeric heterozygotes, and nonchimeric homozygotes. 
Methods of producing transgenic animals are well known in the art See, e.g., Hogan et 
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al, Manipulating the Mouse Embryo: A Laboratory Manual 2d ed., Cold Spring Harbor 
Press (1999); Jackson et al, Mouse Genetics and Transgenics: A Practical Approach. 
Oxford University Press (2000); and Pinkert, Transgenic Animal Technology: A 
Laboratory Handbook. Academic Press (1999). 
5 Any technique known in the art may be used to introduce a nucleic acid molecule 

of the invention into an animal to produce the founder lines of transgenic animals. Such 
techniques include, but are not limited to, pronuclear microinjection, (see, e.g., Paterson 
et al, Appl Microbiol Biotechnol. 40: 691-698 (1994); Carver et al, Biotechnology 11: 
1263-1270 (1993); Wright et al, Biotechnology 9: 830-834 (1991); and U.S. Patent No. 

10 4,873,191, herein incorporated by reference in its entirety); retrovirus-mediated gene 
transfer into germ lines, blastocysts or embryos (see, e.g., Van der Putten et al, Proc. 
Natl Acad. ScL f USA 82: 6148-6152 (1985)); gene targeting in embryonic stem cells (see, 
e.g., Thompson et al, Cell 56: 313-321 (1989)); electroporation of cells or embryos (see, 
e.g, Lo, 1983, Mol Cell. Biol. 3: 1803-1814 (1983)); introduction using a gene gun (see, 

15 e.g., Ulmer et al, Science 259: 1745-49 (1993); introducing nucleic acid constructs into 
embryonic pleuripotent stem cells and transferring the stem cells back into the blastocyst; 
and sperm-mediated gene transfer (see, e.g., Lavitrano et al, Cell 57: 717-723 (1989)). 

Other techniques include, for example, nuclear transfer into enucleated oocytes of 
nuclei from cultured embryonic, fetal, or adult cells induced to quiescence (see, e.g., 

20 Campell et al, Nature 380: 64-66 (1996); Wilmut et al, Nature 385: 810-813 (1997)). 
The present invention provides for transgenic animals that carry the transgene (i.e., a 
nucleic acid molecule of the invention) in all their cells, as well as animals which carry the 
transgene in some, but not all their cells, i.e. e., mosaic animals or chimeric animals. 

The transgene may be integrated as a single transgene or as multiple copies, such 

25 as in concatamers, e. g., head-to-head tandems or head-to-tail tandems. The transgene 

may also be selectively introduced into and activated in a particular cell type by following, 
e.g., the teaching of Lasko et al et al, Proc. Natl Acad. Sci. USA 89: 6232- 6236 (1992). 
The regulatory sequences required for such a cell-type specific activation will depend 
upon the particular cell type of interest, and will be apparent to those of skill in the art 

30 Once transgenic animals have been generated, the expression of the recombinant 

gene may be assayed utilizing standard techniques. Initial screening may be accomplished 
by Southern blot analysis or PCR techniques to analyze animal tissues to verify that 
integration of the transgene has taken place. The level of mRNA expression of the 
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transgene in the tissues of the transgenic animals may also be assessed using techniques 
which include, but are not limited to, Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and reverse transcriptase-PCR (RT-PCR). 
Samples of transgenic gene-expressing tissue may also be evaluated 
5 immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more than 

10 one integration site in order to establish separate lines; inbreeding of separate lines in 

order to produce compound transgenics that express the transgene at higher levels because 
of the effects of additive expression of each transgene; crossing of heterozygous 
transgenic animals to produce animals homozygous for a given integration site in order to 
both augment expression and eliminate the need for screening of animals by DNA 

15 analysis; crossing of separate homozygous lines to produce compound heterozygous or 
homozygous lines; and breeding to place the transgene on a distinct background that is 
appropriate for an experimental model of interest. 

Transgenic animals of the invention have uses which include, but are not limited 
to, animal model systems useful in elaborating the biological function of polypeptides of 

20 the present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 

Methods for creating a transgenic animal with a disruption of a targeted gene are 
also well known in the art. In general, a vector is designed to comprise some nucleotide 

25 sequences homologous to the endogenous targeted gene. The vector is introduced into a 
cell so that it may integrate, via homologous recombination with chromosomal sequences, 
into the endogenous gene, thereby disrupting the function of the endogenous gene. The 
transgene may also be selectively introduced into a particular cell type, thus inactivating 
the endogenous gene in only that cell type. See, e.g., Gu et al. 9 Science 265: 103-106 

30 (1994). The regulatory sequences required for such a cell-type specific inactivation will 
depend upon the particular cell type of interest, and will be apparent to those of skill in the 
art. See, e.g., Smithies et al. 9 Nature 317: 230-234 (1985); Thomas et al t Cell 51: 503- 
512 (1987); Thompson etaL,Cell 5: 313-321 (1989). 
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In one embodiment, a mutant, non-functional nucleic acid molecule of the 
invention (or a completely unrelated DNA sequence) flanked by DNA homologous to the 
endogenous nucleic acid sequence (either the coding regions or regulatory regions of the 
gene) can be used, with or without a selectable marker and/or a negative selectable 

5 marker, to transfect cells that express polypeptides of the invention in vivo. In another 
embodiment, techniques known in the art are used to generate knockouts in cells that 
contain, but do not express the gene of interest Insertion of the DNA construct, via 
targeted homologous recombination, results in inactivation of the targeted gene. Such 
approaches are particularly suited in research and agricultural fields where modifications 

10 to embryonic stem cells can be used to generate animal offspring with an inactive targeted 
gene. See, e.g., Thomas, supra and Thompson, supra. However this approach can be 
routinely adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivo using appropriate viral vectors that 
will be apparent to those of skill in the art. 

15 In further embodiments of the invention, cells that are genetically engineered to 

express the polypeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention (e.g., knockouts) are administered to a 
patient in vivo. Such cells may be obtained from an animal or patient or an MHC 
compatible donor and can include, but are not limited to fibroblasts, bone marrow cells, 

20 blood cells (e.g., lymphocytes), adipocytes, muscle cells, endothelial cells etc. The cells 
are genetically engineered in vitro using recombinant DNA techniques to introduce the 
coding sequence of polypeptides of the invention into the cells, or alternatively, to disrupt 
the coding sequence and/or endogenous regulatory sequence associated with the 
polypeptides of the invention, e.g., by transduction (using viral vectors, and preferably 

25 vectors that integrate the transgene into the cell genome) or transfection procedures, 
including, but not limited to, the use of plasmids, cosmids, YACs, naked DNA, 
electroporation, liposomes, etc. 

The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 

30 expression, and preferably secretion, of the polypeptides of the invention. The engineered 
cells which express and preferably secrete the polypeptides of the invention can be 
introduced into the patient systemically, e.g., in the circulation, or intraperitoneal^. 
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Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e.g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. See, e.g., U.S. Patent Nos. 5,399,349 and 5,460,959, each of which is 
5 incorporated by reference herein in its entirety. 

When the cells to be administered are non-autologous or non-MHC compatible 
cells, they can be administered using well known techniques which prevent the 
development of a host immune response against the introduced cells. For example, the 
cells may be introduced in an encapsulated form which, while allowing for an exchange of 
10 components with the immediate extracellular environment, does not allow the introduced 
cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, but 
are not limited to, animal model systems useful in elaborating the biological function of 
polypeptides of the present invention, studying conditions and/or disorders associated with 
1 5 aberrant expression, and in screening for compounds effective in ameliorating such 
conditions and/or disorders. 

Computer Readable Means 

A further aspect of the invention is a computer readable means for storing the 
nucleic acid and amino acid sequences of the instant invention. In a preferred 

20 embodiment, the invention provides a computer readable means for storing SEQ ID NO: 
95-248 and SEQ ED NO: 1-94 as described herein, as the complete set of sequences or in 
any combination. The records of the computer readable means can be accessed for 
reading and display and for interface with a computer system for the application of 
programs allowing for the location of data upon a query for data meeting certain criteria, 

25 the comparison of sequences, the alignment or ordering of sequences meeting a set of 
criteria, and the like. 

The nucleic acid and amino acid sequences of the invention are particularly useful 
as components in databases useful for search analyses as well as in sequence analysis 
algorithms. As used herein, the terms "nucleic acid sequences of the invention" and 

30 "amino acid sequences of the invention" mean any detectable chemical or physical 

characteristic of a polynucleotide or polypeptide of the invention that is or may be reduced 
to or stored in a computer readable form. These include, without limitation, 
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chromatographic scan data or peak data, photographic data or scan data therefrom, and 
mass spectrographic data. 

This invention provides computer readable media having stored thereon sequences 
of the invention. A computer readable medium may comprise one or more of the 
5 following: a nucleic acid sequence comprising a sequence of a nucleic acid sequence of 
the invention; an amino acid sequence comprising an amino acid sequence of the 
invention; a set of nucleic acid sequences wherein at least one of said sequences comprises 
the sequence of a nucleic acid sequence of the invention; a set of amino acid sequences 
wherein at least one of said sequences comprises the sequence of an amino acid sequence 

10 of the invention; a data set representing a nucleic acid sequence comprising the sequence 
of one or more nucleic acid sequences of the invention; a data set representing a nucleic 
acid sequence encoding an amino acid sequence comprising the sequence of an amino acid 
sequence of the invention; a set of nucleic acid sequences wherein at least one of said 
sequences comprises the sequence of a nucleic acid sequence of the invention; a set of 

15 amino acid sequences wherein at least one of said sequences comprises the sequence of an 
amino acid sequence of the invention; a data set representing a nucleic acid sequence 
comprising the sequence of a nucleic acid sequence of the invention; a data set 
representing a nucleic acid sequence encoding an amino acid sequence comprising the 
sequence of an amino acid sequence of the invention. The computer readable medium can 

20 be any composition of matter used to store information or data, including, for example, 
commercially available floppy disks, tapes, hard drives, compact disks, and video disks. 

Also provided by the invention are methods for the analysis of character 
sequences, particularly genetic sequences. Preferred methods of sequence analysis 
include, for example, methods of sequence homology analysis, such as identity and 

25 similarity analysis, RNA structure analysis, sequence assembly, cladistic analysis, 

sequence motif analysis, open reading frame determination, nucleic acid base calling, and 
sequencing chromatogram peak analysis. 

A computer-based method is provided for performing nucleic acid sequence 
identity or similarity identification. This method comprises the steps of providing a 

30 nucleic acid sequence comprising the sequence of a nucleic acid of the invention in a 
computer readable medium; and comparing said nucleic acid sequence to at least one 
nucleic acid or amino acid sequence to identify sequence identity or similarity. 
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A computer-based method is also provided for performing amino acid homology 
identification, said method comprising the steps of: providing an amino acid sequence 
comprising the sequence of an amino acid of the invention in a computer readable 
medium; and comparing said amino acid sequence to at least one nucleic acid or an amino 
5 acid sequence to identify homology. 

A computer-based method is still further provided for assembly of overlapping 
nucleic acid sequences into a single nucleic acid sequence, said method comprising the 
steps of: providing a first nucleic acid sequence comprising the sequence of a nucleic acid 
of the invention in a computer readable medium; and screening for at least one 
10 overlapping region between said first nucleic acid sequence and a second nucleic acid 
sequence. In addition, the invention includes a method of using patterns of expression 
associated with either the nucleic acids or proteins in a computer-based method to 
diagnose disease. 

Diagnostic Methods for Colon Cancer 

15 The present invention also relates to quantitative and qualitative diagnostic assays 

and methods for detecting, diagnosing, monitoring, staging and predicting cancers by 
comparing expression of a CSNA or a CSP in a human patient that has or may have colon 
cancer, or who is at risk of developing colon cancer, witfi the expression of a CSNA or a 
CSP in a normal human control. For purposes of the present invention, "expression of a 

20 CSNA" or "CSNA expression" means the quantity of CSNA mRNA that can be measured 
by any method known in the art or the level of transcription that can be measured by any 
method known in the art in a cell, tissue, organ or whole patient Similarly, the term 
"expression of a CSP" or "CSP expression" means the amount of CSP that can be 
measured by any method known in the art or the level of translation of a CSNA that can be 

25 measured by any method known in the art. 

The present invention provides methods for diagnosing colon cancer in a patient, 
in particular adenocarcinoma, by analyzing for changes in levels of CSNA or CSP in cells, 
tissues, organs or bodily fluids compared with levels of CSNA or CSP in cells, tissues, 
organs or bodily fluids of preferably the same type from a normal human control, wherein 

30 an increase, or decrease in certain cases, in levels of a CSNA or CSP in the patient versus 
the normal human control is associated with the presence of colon cancer or with a 
predilection to the disease. In another preferred embodiment, the present invention 
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provides methods for diagnosing colon cancer in a patient by analyzing changes in the 
structure of the mRNA of a CSG compared to the mRNA from a normal control These 
changes include, without limitation, aberrant splicing, alterations in polyadenylation 
and/or alterations in 5' nucleotide capping. In yet another preferred embodiment, the 
5 present invention provides methods for diagnosing colon cancer in a patient by analyzing 
changes in a CSP compared to a CSP from a normal patient. These changes include, e.g., 
alterations, including post translational modifications such as glycosylation and/or 
phosphorylation of the CSP or changes in the subcellular CSP localization. 

For purposes of the present invention, diagnosing means that CSNA or CSP levels 

10 are used to determine the presence or absence of disease in a patient. As will be 

understood by those of skill in the art, measurement of other diagnostic parameters may be 
required for definitive diagnosis or determination of the appropriate treatment for the 
disease. The determination may be made by a clinician, a doctor, a testing laboratory, or a 
patient using an over the counter test. The patient may have symptoms of disease or may 

15 be asymptomatic. In addition, the CSNA or CSP levels of the present invention may be 
used as screening marker to determine whether further tests or biopsies are warranted. In 
addition, the CSNA or CSP levels may be used to determine the vulnerability or 
susceptibility to disease. 

In a preferred embodiment, the expression of a CSNA is measured by determining 

20 the amount of a mRNA that encodes an amino acid sequence selected from SEQ ID NO: 
95-248, a homolog, an allelic variant, or a fragment thereof. In a more preferred 
embodiment, the CSNA expression that is measured is the level of expression of a CSNA 
mRNA selected from SEQ ID NO: 1-94, or a hybridizing nucleic acid, homologous 
nucleic acid or allelic variant thereof, or a part of any of these nucleic acid molecules. 

25 CSNA expression may be measured by any method known in the art, such as those 

described supra, including measuring mRNA expression by Northern blot, quantitative or 
qualitative reverse transcriptase PCR (RT-PCR), microarray, dot or slot blots or in situ 
hybridization. See, e.g., Ausubel (1992), supra; Ausubel (1999), supra; Sambrook 
(1989), supra; and Sambrook (2001), supra. CSNA transcription may be measured by any 

30 method known in the art including using a reporter gene hooked up to the promoter of a 
CSG of interest or doing nuclear run-off assays. Alterations in mRNA structure, e.g., 
aberrant splicing variants, may be determined by any method known in the art, including, 
RT-PCR followed by sequencing or restriction analysis. As necessary, CSNA expression 
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may be compared to a known control, such as normal colon nucleic acid, to detect a 
change in expression. 

In another preferred embodiment, the expression of a CSP is measured by 
determining the level of a CSP having an amino acid sequence selected from the group 
5 consisting of SEQ ID NO: 95-248, a homolog, an allelic variant, or a fragment thereof. 
Such levels are preferably determined in at least one of cells, tissues, organs and/or bodily 
fluids, including determination of normal and abnormal levels. Thus, for instance, a 
diagnostic assay in accordance with the invention for diagnosing over- or underexpression 
of a CSNA or CSP compared to normal control bodily fluids, cells, or tissue samples may 

10 be used to diagnose the presence of colon cancer. The expression level of a CSP may be 
determined by any method known in the art, such as those described supra. In a preferred 
embodiment, the CSP expression level may be determined by radioimmunoassays, 
competitive-binding assays, ELISA, Western blot, FACS, immunohistochemistry, 
immunoprecipitation, proteomic approaches: two-dimensional gel electrophoresis (2D 

1 5 electrophoresis) and non-gel-based approaches such as mass spectrometry or protein 

interaction profiling. See, e.g, Harlow (1999), supra; Ausubel (1992), supra; and Ausubel 
(1999), supra. Alterations in the CSP structure may be determined by any method known 
in the art, including, e.g., using antibodies that specifically recognize phosphoserine, 
phosphothreonine or phosphotyrosine residues, two-dimensional polyacrylamide gel 

20 electrophoresis (2D PAGE) and/or chemical analysis of amino acid residues of the protein. 
Id 

In a preferred embodiment, a radioimmunoassay (RIA) or an ELISA is used. An 
antibody specific to a CSP is prepared if one is not already available. In a preferred 
embodiment, the antibody is a monoclonal antibody. The anti-CSP antibody is bound to a 

25 solid support and any free protein binding sites on the solid support are blocked with a 

protein such as bovine serum albumin. A sample of interest is incubated with the antibody 
on the solid support under conditions in which the CSP will bind to the anti-CSP antibody. 
The sample is removed, the solid support is washed to remove unbound material, and an 
anti-CSP antibody that is linked to a detectable reagent (a radioactive substance for RIA 

30 and an enzyme for ELISA) is added to the solid support and incubated under conditions in 
which binding of the CSP to the labeled antibody will occur. After binding, the unbound 
labeled antibody is removed by washing. For an ELISA, one or more substrates are added 
to produce a colored reaction product that is based upon the amount of a CSP in the 
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sample. For an RIA, the solid support is counted for radioactive decay signals by any 
method known in the art. Quantitative results for both RIA and ELISA typically are 
obtained by reference to a standard curve. 

Other methods to measure CSP levels are known in the art. For instance, a 
5 competition assay may be employed wherein an anti-CSP antibody is attached to a solid 

. .j 

support and an allocated amount of a labeled CSP and a sample of interest are incubated 
with the solid support. The amount of labeled CSP attached to the solid support can be 
correlated to the quantity of a CSP in the sample. 

Of the proteomic approaches, 2D PAGE is a well known technique. Isolation of 
10 individual proteins from a sample such as serum is accomplished using sequential 

separation of proteins by isoelectric point and molecular weight Typically, polypeptides 
are first separated by isoelectric point (the first dimension) and then separated by size 
using an electric current (the second dimension). In general, the second dimension is 
perpendicular to the first dimension. Because no two proteins with different sequences are 
i 15 identical on the basis of both size and charge, the result of 2D PAGE is a roughly square 

j gel in which each protein occupies a unique spot. Analysis of the spots with chemical or 

] antibody probes, or subsequent protein microsequencing can reveal the relative abundance 

of a given protein and the identity of the proteins in the sample. 

Expression levels of a CSNA can be determined by any method known in the art, 
20 including PCR and other nucleic acid methods, such as ligase chain reaction (LCR) and 
nucleic acid sequence based amplification (NASBA), can be used to detect malignant cells 
for diagnosis and monitoring of various malignancies. For example, reverse-transcriptase 
PCR (RT-PCR) is a powerful technique which can be used to detect the presence of a 
specific mRNA population in a complex mixture of thousands of other mRNA species. In 
25 RT-PCR, an mRNA species is first reverse transcribed to complementary DNA (cDNA) 
with use of the enzyme reverse transcriptase; the cDNA is then amplified as in a standard 
PCR reaction. 

. V; Hybridization to specific DNA molecules (e.g., oligonucleotides) arrayed on a 

solid support can be used to both detect the expression of and quantitate the level of 
30 expression of one or more CSNAs of interest In this approach, all or a portion of one or 
more CSNAs is fixed to a substrate. A sample of interest, which may comprise RNA, e.g. , 
total RNA or polyA-selected mRNA, or a complementary DNA (cDNA) copy of the RNA 
is incubated with the solid support under conditions in which hybridization will occur 
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between the DNA on the solid support and the nucleic acid molecules in the sample of 
interest. Hybridization between the substrate-bound DNA and the nucleic acid molecules 
in the sample can be detected and quantitated by several means, including, without 
limitation, radioactive labeling or fluorescent labeling of the nucleic acid molecule or a 
5 secondary molecule designed to detect the hybrid. 

The above tests can be carried out on samples derived from a variety of cells, 
bodily fluids and/or tissue extracts such as homogenates or solubilized tissue obtained 
from a patient. Tissue extracts are obtained routinely from tissue biopsy and autopsy 
material. Bodily fluids useful in the present invention include blood, urine, saliva or any 

10 other bodily secretion or derivative thereof. As used herein "blood" includes whole blood, 
plasma, serum, circulating epithelial cells, constituents, or any derivative of blood. 

In addition to detection in bodily fluids, the proteins and nucleic acids of the 
invention are suitable to detection by cell capture technology. Whole cells may be 
captured by a variety methods for example magnetic separation, such as described in U.S. 

15 Patent. Nos. 5,200,084; 5,186,827; 5,108,933; and 4,925,788, the disclosures of which are 
incorporated herein by reference in their entireties. Epithelial cells may be captured using 
such products as Dynabeads® or CELLection™ (Dynal Biotech, Oslo, Norway). 
Alternatively, fractions of blood may be captured, e.g., the buffy coat fraction (50mm cells 
isolated from 5ml of blood) containing epithelial cells. In addition, cancer cells may be 

20 captured using the techniques described in WO 00/47998, the disclosure of which is 
incorporated herein by reference in its entirety. Once the cells are captured or 
concentrated, the proteins or nucleic acids are detected by the means described in the 
subject application. Alternatively, nucleic acids may be captured directly from blood 
samples, see U.S. Patent Nos. 6,156,504, 5,501,963; or WO 01/42504 , the disclosures of 

25 which are incorporated herein by reference in their entireties. 

In a preferred embodiment, the specimen tested for expression of CSNA or CSP 
includes without limitation colon tissue, fecal samples, colonocytes, colon cells grown in 
cell culture, blood, serum, lymph node tissue, and lymphatic fluid. In another preferred 
embodiment, especially when metastasis of a primary colon cancer is known or suspected, 

30 specimens include, without limitation, tissues from brain, bone, bone marrow, liver, lungs, 
and adrenal glands. In general, the tissues may be sampled by biopsy, including, without 
limitation, needle biopsy, e.g., transthoracic needle aspiration, cervical mediatinoscopy, 
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endoscopic lymph node biopsy, video-assisted thoracoscopy, exploratory thoracotomy, 
bone marrow biopsy and bone marrow aspiration. 

Colonocytes represent an important source of the CSP or CSNA because they 
provide a picture of the immediate past metabolic history of the GI tract of a subject In 
5 addition, such cells are representative of the cell population from a statistically large 
sampling frame reflecting the state of the colonic mucosa along the entire length of the 
colon in a non-invasive manner, in contrast to a limited sampling by colonic biopsy using 
an invasive procedure involving endoscopy. Specific examples of patents describing the 
isolation of colonocytes include U.S. Patent Nos. 6,335,193; 6,020,137 5,741,650; 

10 6,258,541; US 2001 0026925 Al; WO 00/63358 Al, the disclosures of which are 
incorporated herein by reference in their entireties. 

All the methods of the present invention may optionally include determining the 
expression levels of one or more other cancer markers in addition to determining the 
expression level of a CSNA or CSP. In many cases, the use of another cancer marker will 

15 decrease the likelihood of false positives or false negatives. In one embodiment, the one 
or more other cancer markers include other CSNA or CSPs as disclosed herein. Other 
cancer markers useful in the present invention will depend on the cancer being tested and 
are known to those of skill in the art. In a preferred embodiment, at least one other cancer 
marker in addition to a particular CSNA or CSP is measured. In a more preferred 

20 embodiment, at least two other additional cancer markers are used. In an even more 

preferred embodiment, at least three, more preferably at least five, even more preferably at 
least ten additional cancer markers are used. 

Diagnosing 

In one aspect, the invention provides a method for determining the expression 
25 levels and/or structural alterations of one or more CSNA and/or CSP in a sample from a 
patient suspected of having colon cancer. In general, the method comprises the steps of 
obtaining the sample from the patient, determining the expression level or structural 
alterations of a CSNA and/or CSP and then ascertaining whether the patient has colon 
cancer from the expression level of the CSNA or CSP. In general, if high expression 
30 relative to a control of a CSNA or CSP is indicative of colon cancer, a diagnostic assay is 
considered positive if the level of expression of the CSNA or CSP is at least one and a half 
times higher, and more preferably are at least two times higher, still more preferably five 
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times higher, even more preferably at least ten times higher, than in preferably the same 
cells, tissues or bodily fluid of a normal human control. In contrast, if low expression 
relative to a control of a CSNA or CSP is indicative of colon cancer, a diagnostic assay is 
considered positive if the level of expression of the CSNA or CSP is at least one and a half 
5 times lower, and more preferably are at least two times lower, still more preferably five 
times lower, even more preferably at least ten times lower than in preferably the same 
cells, tissues or bodily fluid of a normal human control. The normal human control may 
be from a different patient or from uninvolved tissue of the same patient 

The present invention also provides a method of determining whether colon cancer 

10 has metastasized in a patient. One may identify whether the colon cancer has metastasized 
by measuring the expression levels and/or structural alterations of one or more CSNAs 
and/or CSPs in a variety of tissues. The presence of a CSNA or CSP in a tissue other than 
colon at levels higher than that of corresponding noncancerous tissue (e.g., the same tissue 
from another individual) is indicative of metastasis if high level expression of a CSNA or 

15 CSP is associated with colon cancer. Similarly, the presence of a CSNA or CSP in a 
tissue other than colon at levels lower than that of corresponding noncancerous tissue is 
indicative of metastasis if low level expression of a CSNA or CSP is associated with colon 
cancer. Further, the presence of a structurally altered CSNA or CSP that is associated 
with colon cancer is also indicative of metastasis. 

20 In general, if high expression relative to a control of a CSNA or CSP is indicative 

of metastasis, an assay for metastasis is considered positive if the level of expression of 
the CSNA or CSP is at least one and a half times higher, and more preferably are at least 
two times higher, still more preferably five times higher, even more preferably at least ten 
times higher, than in preferably the same cells, tissues or bodily fluid of a normal human 

25 control. In contrast, if low expression relative to a control of a CSNA or CSP is indicative 
of metastasis, an assay for metastasis is considered positive if the level of expression of 
the CSNA or CSP is at least one and a half times lower, and more preferably are at least 
two times lower, still more preferably five times lower, even more preferably at least ten 
times lower than in preferably the same cells, tissues or bodily fluid of a normal human 

30 control. 



Staging 
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The invention also provides a method of staging colon cancer in a human patient. 
The method comprises identifying a human patient having colon cancer and analyzing 
cells, tissues or bodily fluids from such human patient for expression levels and/or 
structural alterations of one or more CSNAs or CSPs. First, one or more tumors from a 
5 variety of patients are staged according to procedures well known in the art, and the 

expression levels of one or more CSNAs or CSPs is determined for each stage to obtain a 
standard expression level for each CSNA and CSP. Then, the CSNA or CSP expression 
levels of the CSNA or CSP are determined in a biological sample from a patient whose 
stage of cancer is not known. The CSNA or CSP expression levels from the patient are 
10 then compared to the standard expression level. By comparing the expression level of the 
CSNAs and CSPs from the patient to the standard expression levels, one may determine 
the stage of the tumor. The same procedure may be followed using structural alterations 
of a CSNA or CSP to determine the stage of a colon cancer. 

Monitoring 

15 Further provided is a method of monitoring colon cancer in a human patient. One 

may monitor a human patient to determine whether there has been metastasis and, if there 
has been, when metastasis began to occur. One may also monitor a human patient to 
determine whether a preneoplastic lesion has become cancerous. One may also monitor a 
human patient to determine whether a therapy, e.g., chemotherapy, radiotherapy or 

20 surgery, has decreased or eliminated the colon cancer. The monitoring may determine if 
there has been a reoccurrence and, if so, determine its nature. The method comprises 
identifying a human patient that one wants to monitor for colon cancer, periodically 
analyzing cells, tissues or bodily fluids from such human patient for expression levels of 
one or more CSNAs or CSPs, and comparing the CSNA or CSP levels over time to those 

25 CSNA or CSP expression levels obtained previously. Patients may also be monitored by 
measuring one or more structural alterations in a CSNA or CSP that are associated with 
colon cancer. 

If increased expression of a CSNA or CSP is associated with metastasis, treatment 
failure, or conversion of a preneoplastic lesion to a cancerous lesion, then detecting an 
30 increase in the expression level of a CSNA or CSP indicates that the tumor is 

metastasizing, that treatment has failed or that the lesion is cancerous, respectively. One 
having ordinary skill in the art would recognize that if this were the case, then a decreased 
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expression level would be indicative of no metastasis, effective therapy or failure to 
progress to a neoplastic lesion. If decreased expression of a CSNA or CSP is associated 
with metastasis, treatment failure, or conversion of a preneoplastic lesion to a cancerous 
lesion, then detecting a decrease in the expression level of a CSNA or CSP indicates that 
5 the tumor is metastasizing, that treatment has failed or that the lesion is cancerous, 
respectively. In a preferred embodiment, the levels of CSNAs or CSPs are determined 
from the same cell type, tissue or bodily fluid as prior patient samples. Monitoring a 
patient for onset of colon cancer metastasis is periodic and preferably is done on a 
quarterly basis, but may be done more or less frequently. 

10 The methods described herein can further be utilized as prognostic assays to 

identify subjects having or at risk of developing a disease or disorder associated with 
increased or decreased expression levels of a CSNA and/or CSP. The present invention 
provides a method in which a test sample is obtained from a human patient and one or 
more CSNAs and/or CSPs are detected. The presence of higher (or lower) CSNA or CSP 

15 levels as compared to normal human controls is diagnostic for the human patient being at 
risk for developing cancer, particularly colon cancer. The effectiveness of therapeutic 
agents to decrease (or increase) expression or activity of one or more CSNAs and/or CSPs 
of the invention can also be monitored by analyzing levels of expression of the CSNAs 
and/or CSPs in a human patient in clinical trials or in in vitro screening assays such as in 

20 human cells. In this way, the gene expression pattern can serve as a marker, indicative of 
the physiological response of the human patient or cells, as the case may be, to the agent 
being tested. 

Detection of Genetic Lesions or Mutations 

The methods of the present invention can also be used to detect genetic lesions or 
25 mutations in a CSG, thereby determining if a human with the genetic lesion is susceptible 
to developing colon cancer or to determine what genetic lesions are responsible, or are 
partly responsible, for a person's existing colon cancer. Genetic lesions can be detected, 
for example, by ascertaining the existence of a deletion, insertion and/or substitution of 
one or more nucleotides from the CSGs of this invention, a chromosomal rearrangement 
30 of a CSG, an aberrant modification of a CSG (such as of the methylation pattern of the 
genomic DNA), or allelic loss of a CSG. Methods to detect such lesions in the CSG of 
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this invention are known to those having ordinary skill in the art following the teachings 
of the specification. 

Methods of Detecting Noncancerous colon Diseases 

The present invention also provides methods for determining the expression levels 
5 and/or structural alterations of one or more CSNAs and/or CSPs in a sample from a patient 
suspected of having or known to have a noncancerous colon disease. In general, the 
method comprises the steps of obtaining a sample from the patient, determining the 
expression level or structural alterations of a CSNA and/or CSP, comparing the expression 
level or structural alteration of the CSNA or CSP to a normal colon control, and then 

10 ascertaining whether the patient has a noncancerous colon disease. In general, if high 
expression relative to a control of a CSNA or CSP is indicative of a particular 
noncancerous colon disease, a diagnostic assay is considered positive if the level of 
expression of the CSNA or CSP is at least two times higher, and more preferably are at 
least five times higher, even more preferably at least ten times higher, than in preferably 

15 the same cells, tissues or bodily fluid of a normal human control. In contrast, if low 

expression relative to a control of a CSNA or CSP is indicative of a noncancerous colon 
disease, a diagnostic assay is considered positive if the level of expression of the CSNA or 
CSP is at least two times lower, more preferably are at least five times lower, even more 
preferably at least ten times lower than in preferably the same cells, tissues or bodily fluid 

20 of a normal human control. The normal human control may be from a different patient or 
from uninvolved tissue of the same patient 

One having ordinary skill in the art may determine whether a CSNA and/or CSP is 
associated with a particular noncancerous colon disease by obtaining colon tissue from a 
patient having a noncancerous colon disease of interest and determining which CSNAs 

25 and/or CSPs are expressed in the tissue at either a higher or a lower level than in normal 
colon tissue. In another embodiment, one may determine whether a CSNA or CSP 
exhibits structural alterations in a particular noncancerous colon disease state by obtaining 
colon tissue from a patient having a noncancerous colon disease of interest and 
determining the structural alterations in one or more CSNAs and/or CSPs relative to 

30 normal colon tissue. 
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Methods for Identifying colon Tissue 

In another aspect, the invention provides methods for identifying colon tissue. 
These methods are particularly useful in, e.g., forensic science, colon cell differentiation 
and development, and in tissue engineering. 
5 In one embodiment, the invention provides a method for determining whether a 

sample is colon tissue or has colon tissue-like characteristics. The method comprises the 
steps of providing a sample suspected of comprising colon tissue or having colon tissue- 
like characteristics, determining whether the sample expresses one or more CSNAs and/or 
CSPs, and, if the sample expresses one or more CSNAs and/or CSPs, concluding that the 

10 sample comprises colon tissue. In a preferred embodiment, the CSNA encodes a 
polypeptide having an amino acid sequence selected from SEQ ID NO: 95-248, or a 
homolog, allelic variant or fragment thereof. In a more preferred embodiment, the CSNA 
has a nucleotide sequence selected from SEQ ID NO: 1-94, or a hybridizing nucleic acid, 
an allelic variant or a part thereof. Determining whether a sample expresses a CSNA can 

15 be accomplished by any method known in the art. Preferred methods include 

hybridization to microarrays, Northern blot hybridization, and quantitative or qualitative 
RT-PCR. In another preferred embodiment, the method can be practiced by determining 
whether a CSP is expressed. Determining whether a sample expresses a CSP can be 
accomplished by any method known in the art. Preferred methods include Western blot, 

20 ELIS A, RIA and 2D PAGE. In one embodiment, the CSP has an amino acid sequence 
selected from SEQ ID NO: 95-248, or a homolog, allelic variant or fragment thereof. In 
another preferred embodiment, the expression of at least two CSNAs and/or CSPs is 
determined In a more preferred embodiment, the expression of at least three, more 
preferably four and even more preferably five CSNAs and/or CSPs are determined. 

25 In one embodiment, the method can be used to determine whether an unknown 

tissue is colon tissue. This is particularly useful in forensic science, in which small, 
damaged pieces of tissues that are not identifiable by microscopic or other means are 
recovered from a crime or accident scene. In another embodiment, the method can be 
used to determine whether a tissue is differentiating or developing into colon tissue. This 

30 is important in monitoring the effects of the addition of various agents to cell or tissue 
culture, e.g., in producing new colon tissue by tissue engineering. These agents include, 
e.g., growth and differentiation factors, extracellular matrix proteins and culture medium. 
Other factors that may be measured for effects on tissue development and differentiation 
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include gene transfer into the cells or tissues, alterations in pH, aqueousrair interface and 
various other culture conditions. 

Methods for Producing and Modifying colon Tissue 

In another aspect, the invention provides methods for producing engineered colon 
5 tissue or cells. In one embodiment, the method comprises the steps of providing cells, 
introducing a CSNA or a CSG into the cells, and growing the cells under conditions in 
which they exhibit one or more properties of colon tissue cells. In a preferred 
embodiment, the cells are pleuripotent. As is well known in the art, normal colon tissue 
comprises a large number of different cell types. Thus, in one embodiment, the 

10 engineered colon tissue or cells comprises one of these cell types. In another embodiment, 
the engineered colon tissue or cells comprises more than one colon cell type. Further, the 
culture conditions of the cells or tissue may require manipulation in order to achieve full 
differentiation and development of the colon cell tissue. Methods for manipulating culture 
conditions are well known in the art. 

15 Nucleic acid molecules encoding one or more CSPs are introduced into cells, 

preferably pleuripotent cells. In a preferred embodiment, the nucleic acid molecules 
encode CSPs having amino acid sequences selected from SEQ ID NO: 95-248, or 
homologous proteins, analogs, allelic variants or fragments thereof. In a more preferred 
embodiment, the nucleic acid molecules have a nucleotide sequence selected from SEQ ID 

20 NO: 1-94, or hybridizing nucleic acids, allelic variants or parts thereof. In another highly 
preferred embodiment, a CSG is introduced into the cells. Expression vectors and 
methods of introducing nucleic acid molecules into cells are well known in the art and are 
described in detail, supra. 

Artificial colon tissue may be used to treat patients who have lost some or all of 

25 their colon function. 

Pharmaceutical Compositions 

In another aspect, the invention provides pharmaceutical compositions comprising 
the nucleic acid molecules, polypeptides, fusion proteins, antibodies, antibody derivatives, 
antibody fragments, agonists, antagonists, or inhibitors of the present invention. In a 
30 preferred embodiment, the pharmaceutical composition comprises a CSNA or part thereof. 
In a more preferred embodiment, the CSNA has a nucleotide sequence selected from the 
group consisting of SEQ ID NO: 1-94, a nucleic acid that hybridizes thereto, an allelic 
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variant thereof, or a nucleic acid that has substantial sequence identity thereto. In another 
preferred embodiment, the pharmaceutical composition comprises a CSP or fragment 
thereof. In a more preferred embodiment, the pharmaceutical composition comprises a 
CSP having an amino acid sequence that is selected from the group consisting of SEQ ID 
5 NO: 95-248, a polypeptide that is homologous thereto, a fusion protein comprising all or a 
portion of the polypeptide, or an analog or derivative thereof. In another preferred 
embodiment, the pharmaceutical composition comprises an anti-CSP antibody, preferably 
an antibody that specifically binds to a CSP having an amino acid that is selected from the 
group consisting of SEQ ID NO: 95-248, or an antibody that binds to a polypeptide that is 
1 0 homologous thereto, a fusion protein comprising all or a portion of the polypeptide, or an 
analog or derivative thereof. 

Due to the association of angiogenesis with cancer vascularization there is great 
need of new markers and methods for diagnosing angiogenesis activity to identify 
developing tumors and angiogenesis related diseases. Furthermore, great need is also 
present for new molecular targets useful in the treatment of angiogenesis and angiogenesis 
related diseases such as cancer. In addition known modulators of angiogenesis such as 
endostatin or vascular endothelial growth factor (VEGF). Use of the methods and 
compositions disclosed herein in combination with anti-angiogenesis drugs, drugs that 
block the matrix breakdown (such as BMS-275291, Dalteparin (Fragmin®), Suramin), 
20 drugs that inhibit endothelial cells (2-methoxyestradioI (2-ME), CC-5013 (Thalidomide 
Analog), Combretastatin A4 Phosphate, LY3 17615 (Protein Kinase C Beta Inhibitor), Soy 
Isoflavone (Genistein; Soy Protein Isolate), Thalidomide), drugs that block activators of 
angiogenesis (AE-941 (Neovastat™; GW786034), Anti-VEGF Antibody (Bevacizumab; 
Avastin™), Interferon-alpha, PTK787/ZK 222584, VEGF-Trap, ZD6474), Drugs that 
25 inhibit endothelial-specific integrin/survival signaling (EMD 121974, Anti-Anb3 Integrin 
Antibody (Medi-522; Vitaxin™)). 

Such a composition typically contains from about 0.1 to 90% by weight of a 
therapeutic agent of the invention formulated in and/or with a pharmaceutically acceptable 
carrier or excipient 

30 Pharmaceutical formulation is a well-established art that is further described in 

Gennaro (ed.), Remington: The Science and Practice of Pharmacy 20* ed., Lippincott, 
Williams & Wilkins (2000); Ansel et al, Pharmaceutical Dosage Forms ™d Dn.p 
Delivery Systems, 7 th ed., Lippincott Williams & Wilkins (1999); and Kibbe (ed.), 
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Handbook of Pharmaceutical Excipients American Pharmaceutical Association, 3 rd ed. 
(2000) and thus need not be described in detail herein. 

Briefly, formulation of the pharmaceutical compositions of the present invention 
will depend upon the route chosen for administration. The pharmaceutical compositions 
5 utilized in this invention can be administered by various routes including both enteral and 
parenteral routes, including oral, intravenous, intramuscular, subcutaneous, inhalation, 
topical, sublingual, rectal, intra-arterial, intramedullary, intrathecal, intraventricular, 
transmucosal, transdermal, intranasal, intraperitoneal, intrapulmonary, and intrauterine. 

Oral dosage forms can be formulated as tablets, pills, dragees, capsules, liquids, 
10 gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

Solid formulations of the compositions for oral administration can contain suitable 
carriers or excipients, such as carbohydrate or protein fillers, such as sugars, including 
lactose, sucrose, mannitol, or sorbitol; starch from corn, wheat, rice, potato, or other 
plants; cellulose, such as methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
15 carboxymethylcellulose, or microcrystalline cellulose; gums including arabic and 
tragacanth; proteins such as gelatin and collagen; inorganics, such as kaolin, calcium 
carbonate, dicalcium phosphate, sodium chloride; and other agents such as acacia and 
alginic acid. 

Agents that facilitate disintegration and/or solubilization can be added, such as the 
20 cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt thereof, such as sodium 
alginate, microcrystalline cellulose, cornstarch, sodium starch glycolate, and alginic acid. 

Tablet binders that can be used include acacia, methylcellulose, sodium 
carboxymethylcellulose, polyvinylpyrrolidone (Povidone™), hydroxypropyl 
methylcellulose, sucrose, starch and ethylcellulose. 
25 Lubricants that can be used include magnesium stearates, stearic acid, silicone 

fluid, talc, waxes, oils, and colloidal silica. 

Fillers, agents that facilitate disintegration and/or solubilization, tablet binders and 
lubricants, including the aforementioned, can be used singly or in combination. 

Solid oral dosage forms need not be uniform throughout. For example, dragee 
30 cores can be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which can also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic 
solvents or solvent mixtures. 
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Oral dosage forms of the present invention include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or 
sorbitol. Push-fit capsules can contain active ingredients mixed with a filler or binders, 
such as lactose or starches, lubricants, such as talc or magnesium stearate, and, optionally, 
5 stabilizers. In soft capsules, the active compounds can be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with or without 
stabilizers. 

Additionally, dyestuffs or pigments can be added to the tablets or dragee coatings 
for product identification or to characterize the quantity of active compound, i.e., dosage. 

10 Liquid formulations of the pharmaceutical compositions for oral (enteral) 

administration are prepared in water or other aqueous vehicles and can contain various 
suspending agents such as methylcellulose, alginates, tragacanth, pectin, kelgin, 
carrageenan, acacia, polyvinylpyrrolidone, and polyvinyl alcohol. The liquid formulations 
can also include solutions, emulsions, syrups and elixirs containing, together with the 

1 5 active compound(s), wetting agents, sweeteners, and coloring and flavoring agents. 

The pharmaceutical compositions of the present invention can also be formulated 
for parenteral administration. Formulations for parenteral administration can be in the 
form of aqueous or non-aqueous isotonic sterile injection solutions or suspensions. 

For intravenous injection, water soluble versions of the compounds of the present 

20 invention are formulated in, or if provided as a lyophilate, mixed with, a physiologically 
acceptable fluid vehicle, such as 5% dextrose ("D5"), physiologically buffered saline, 
0.9% saline, Hanks' solution, or Ringer's solution. Intravenous formulations may include 
carriers, excipients or stabilizers including, without limitation, calcium, human serum 
albumin, citrate, acetate, calcium chloride, carbonate, and other salts. 

25 Intramuscular preparations, e.g. a sterile formulation of a suitable soluble salt form 

of the compounds of the present invention, can be dissolved and administered in a 
pharmaceutical excipient such as Water-for-Injection, 0.9% saline, or 5% glucose solution. 
Alternatively, a suitable insoluble form of the compound can be prepared and 
administered as a suspension in an aqueous base or a pharmaceutically acceptable oil base, 

30 such as an ester of a long chain fatty acid (e.g., ethyl oleate), fatty oils such as sesame oil, 
triglycerides, or liposomes. 

Parenteral formulations of the compositions can contain various carriers such as 
vegetable oils, dimethylacetamide, dimethylformamide, ethyl lactate, ethyl carbonate, 



WO 2004/050858 



PCT/US2003/038808 



120 

isopropyl myristate, ethanol, polyols (glycerol, propylene glycol, liquid polyethylene 
glycol, and the like). 

Aqueous injection suspensions can also contain substances that increase the 
viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. 
5 Non-lipid polycationic amino polymers can also be used for delivery. Optionally, the 
suspension can also contain suitable stabilizers or agents that increase the solubility of the 
compounds to allow for the preparation of highly concentrated solutions. 

Pharmaceutical compositions of the present invention can also be formulated to 
permit injectable, long-term, deposition. Injectable depot forms may be made by forming 

10 microencapsulated matrices of the compound in biodegradable polymers such as 

polylactide-polyglycolide. Depending upon the ratio of drug to polymer and the nature of 
the particular polymer employed, the rate of drug release can be controlled. Examples of 
other biodegradable polymers include poly(orthoesters) and poly(anhydrides). Depot 
injectable formulations are also prepared by entrapping the drug in microemulsions that 

1 5 are compatible with body tissues. 

The pharmaceutical compositions of the present invention can be administered 
topically. For topical use the compounds of the present invention can also be prepared in 
suitable forms to be applied to the skin, or mucus membranes of the nose and throat, and 
can take the form of lotions, creams, ointments, liquid sprays or inhalants, drops, tinctures, 

20 lozenges, or throat paints. Such topical formulations further can include chemical 
compounds such as dimethylsulfoxide (DMSO) to facilitate surface penetration of the 
active ingredient. In other transdermal formulations, typically in patch-delivered 
formulations, the pharmaceutical^ active compound is formulated with one or more skin 
penetrants, such as 2-N-methyl-pyrroiidone (NMP) or Azone. A topical semi-solid 

25 ointment formulation typically contains a concentration of the active ingredient from 
about 1 to 20%, e.g., 5 to 10%, in a carrier such as a pharmaceutical cream base. 

For application to the eyes or ears, the compounds of the present invention can be 
presented in liquid or semi-liquid form formulated in hydrophobic or hydrophilic bases as 
ointments, creams, lotions, paints or powders. 

30 For rectal administration the compounds of the present invention can be 

administered in the form of suppositories admixed with conventional carriers such as 
cocoa butter, wax or other glyceride. 
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Inhalation formulations can also readily be formulated. For inhalation, various 
powder and liquid formulations can be prepared. For aerosol preparations, a sterile 
formulation of the compound or salt form of the compound may be used in inhalers, such 
as metered dose inhalers, and nebulizers. Aerosolized forms may be especially useful for 
5 treating respiratory disorders. 

Alternatively, the compounds of the present invention can be in powder form for 
reconstitution in the appropriate pharmaceutically acceptable carrier at the time of 
delivery. 

The pharmaceutically active compound in the pharmaceutical compositions of the 
10 present invention can be provided as the salt of a variety of acids, including but not limited 
to hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic acid. Salts tend to be 
more soluble in aqueous or other protonic solvents than are the corresponding free base 
forms. 

After pharmaceutical compositions have been prepared, they are packaged in an 
1 5 appropriate container and labeled for treatment of an indicated condition. 

The active compound will be present in an amount effective to achieve the 
intended purpose. The determination of an effective dose is well within the capability of 
those skilled in the art. 

A "therapeutically effective dose" refers to that amount of active ingredient, for 
20 example CSP polypeptide, fusion protein, or fragments thereof, antibodies specific for 
CSP, agonists, antagonists or inhibitors of CSP, which ameliorates the signs or symptoms 
of the disease or prevent progression thereof; as would be understood in the medical arts, 
cure, although desired, is not required. 

The therapeutically effective dose of the pharmaceutical agents of the present 
25 invention can be estimated initially by in vitro tests, such as cell culture assays, followed 
by assay in model animals, usually mice, rats, rabbits, dogs, or pigs. The animal model 
can also be used to determine an initial preferred concentration range and route of 
administration. 

For example, the ED50 (the dose therapeutically effective in 50% of the 
30 population) and LD50 (the dose lethal to 50% of the population) can be determined in one 
or more cell culture of animal model systems. The dose ratio of toxic to therapeutic 
effects is the therapeutic index, which can be expressed as LD50/ED50. Pharmaceutical 
compositions that exhibit large therapeutic indices are preferred. 
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The data obtained from cell culture assays and animal studies are used in 
formulating an initial dosage range for human use, and preferably provide a range of 
circulating concentrations that includes the ED50 with little or no toxicity. After 
administration, or between successive administrations, the circulating concentration of 
5 active agent varies within this range depending upon pharmacokinetic factors well known 
in the art, such as the dosage form employed, sensitivity of the patient, and the route of 
administration. 

The exact dosage will be determined by the practitioner, in light of factors specific 
to the subject requiring treatment. Factors that can be taken into account by the 

10 practitioner include the severity of the disease state, general health of the subject, age, 
weight, gender of the subject, diet, time and frequency of administration, drug 
combination(s), reaction sensitivities, and tolerance/response to therapy. Long-acting 
pharmaceutical compositions can be administered every 3 to 4 days, every week, or once 
every two weeks depending on half-life and clearance rate of the particular formulation. 

15 Normal dosage amounts may vary from 0. 1 to 100,000 micrograms, up to a total 

dose of about 1 g, depending upon the route of administration. Where the therapeutic 
agent is a protein or antibody of the present invention, the therapeutic protein or antibody 
agent typically is administered at a daily dosage of 0.01 mg to 30 mg/kg of body weight of 
the patient (e.g., Img/kg to 5 mg/kg). The pharmaceutical formulation can be 

20 administered in multiple doses per day, if desired, to achieve the total desired daily dose. 
Guidance as to particular dosages and methods of delivery is provided in the 
literature and generally available to practitioners in the art Those skilled in the art will 
employ different formulations for nucleotides than for proteins or their inhibitors. 
Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

25 conditions, locations, etc. 

Conventional methods, known to those of ordinary skill in the art of medicine, can 
be used to administer the pharmaceutical formulation(s) of the present invention to the 
patient. The pharmaceutical compositions of the present invention can be administered 
alone, or in combination with other therapeutic agents or interventions. 



30 



Therapeutic Methods 

The present invention further provides methods of treating subjects having defects 
in a gene of the invention, e.g. 9 in expression, activity, distribution, localization, and/or 
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solubility, which can manifest as a disorder of colon function. As used herein, "treating" 
includes all medically-acceptable types of therapeutic intervention, including palliation 
and prophylaxis (prevention) of disease. The term "treating" encompasses any 
improvement of a disease, including minor improvements. These methods are discussed 
5 below. 

Gene Therapy and Vaccines 

The isolated nucleic acids of the present invention can also be used to drive in vivo 
expression of the polypeptides of the present invention. In vivo expression can be driven 
from a vector, typically a viral vector, often a vector based upon a replication incompetent 

10 retrovirus, an adenovirus, or an adeno-associated virus (AAV), for the purpose of gene 

therapy. In vivo expression can also be driven from signals endogenous to the nucleic acid > 
or from a vector, often a plasmid vector, such as pVAXl (Invitrogen, Carlsbad, CA, 
USA), for purpose of "naked" nucleic acid vaccination, as further described in U.S. Patent 
Nos. 5,589,466; 5,679,647; 5,804,566; 5,830,877; 5,843,913; 5,880,104; 5,958,891; 

15 5,985,847; 6,017,897; 6,1 10,898; 6,204,250, the disclosures of which are incorporated 
herein by reference in their entireties. For cancer therapy, it is preferred that the vector 
also be tumor-selective. See, e.g., Doronin et al. 9 J. Virol. 75: 3314-24 (2001). 

In another embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising a nucleic 

20 acid molecule of the present invention is administered. The nucleic acid molecule can be 
delivered in a vector that drives expression of a CSP, fusion protein, or fragment thereof, 
or without such vector. Nucleic acid compositions that can drive expression of a CSP are 
administered, for example, to complement a deficiency in the native CSP, or as DNA 
vaccines. Expression vectors derived from virus, replication deficient retroviruses, 

25 adenovirus, adeno-associated (AAV) virus, herpes virus, or vaccinia virus can be used as 
can plasmids. See, e.g., Cid-Arregui, supra. In a preferred embodiment, the nucleic acid 
molecule encodes a CSP having the amino acid sequence of SEQ ID NO: 95-248, or a 
fragment, fusion protein, allelic variant or homolog thereof. j 
In still other therapeutic methods of the present invention, pharmaceutical 

30 compositions comprising host cells that express a CSP, fusions, or fragments thereof can j 
be administered In such cases, the cells are typically autologous, so as to circumvent 
xenogeneic or allotypic rejection, and are administered to complement defects in CSP 



j 
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production or activity. In a preferred embodiment, the nucleic acid molecules in the cells 
encode a CSP having the amino acid sequence of SEQ ID NO: 95-248, or a fragment, 
fusion protein, allelic variant or homolog thereof. 

Antisense Administration 
5 Antisense nucleic acid compositions, or vectors that drive expression of a CSG 

antisense nucleic acid, are administered to downregulate transcription and/or translation of 
a CSG in circumstances in which excessive production, or production of aberrant protein, 
is the pathophysiologic basis of disease. 

Antisense compositions useful in therapy can have a sequence that is 
10 complementary to coding or to noncoding regions of a CSG. For example, 

oligonucleotides derived from the transcription initiation site, e.g., between positions -10 
and +10 from the start site, are preferred. 

Catalytic antisense compositions, such as ribozymes, that are capable of 
sequence-specific hybridization to CSG transcripts, are also useful in therapy. See, e.g., 
15 Phylactou, Adv. DmgDeliv. Rev. 44(2-3): 97-108 (2000); Phylactou et al, Hum. Mol 
Genet 7(10): 1649-53 (1998); Rossi, Ciba Found Symp. 209: 195-204 (1997); and 
Sigurdsson et al. , Trends Biotechnol 1 3(8): 286-9 (1 995). 

Other nucleic acids useful in the therapeutic methods of the present invention are 
those that are capable of triplex helix formation in or near the CSG genomic locus. Such 
20 triplexing oligonucleotides are able to inhibit transcription. See, e.g., Intody et al, Nucleic 
Acids Res. 28(21): 4283-90 (2000); and McGuffie et al. , Cancer Res. 60(14): 3790-9 
(2000). Pharmaceutical compositions comprising such triplex forming oligos (TFOs) are 
administered in circumstances in which excessive production, or production of aberrant 
protein, is a pathophysiologic basis of disease. 
25 In a preferred embodiment, the antisense molecule is derived from a nucleic acid 

molecule encoding a CSP, preferably a CSP comprising an amino acid sequence of SEQ 
ID NO: 95-248, or a fragment, allelic variant or homolog thereof. In a more preferred 
embodiment, the antisense molecule is derived from a nucleic acid molecule having a 
nucleotide sequence of SEQ ID NO: 1-94, or a part, allelic variant, substantially similar or 
30 hybridizing nucleic acid thereof. 



Polypeptide Administration 



wo 
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In one embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising a CSP, a 
fusion protein, fragment, analog or derivative thereof is administered to a subject with a 
clinically-significant CSP defect 
5 Protein compositions are administered, for example, to complement a deficiency in 

native CSP. In other embodiments, protein compositions are administered as a vaccine to 
elicit a humoral and/or cellular immune response to CSP. The immune response can be 
1 used to modulate activity of CSP or, depending on the immunogen, to immunize against 

i aberrant or aberrantly expressed forms, such as mutant or inappropriately expressed 

! 10 isoforms. In yet other embodiments, protein fusions having a toxic moiety are 

j administered to ablate cells that aberrantly accumulate CSP. 

vV.j In a preferred embodiment, the polypeptide administered is a CSP comprising an 

amino acid sequence of SEQ ID NO: 95-248, or a fusion protein, allelic variant, homolog, 
analog or derivative thereof. In a more preferred embodiment, the polypeptide is encoded 
15 by a nucleic acid molecule having a nucleotide sequence of SEQ ID NO: 1-94, or a part, 
allelic variant, substantially similar or hybridizing nucleic acid thereof. 

i 

Antibody, Agonist and Antagonist Administration 

In another embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising an antibody 
20 (including fragment or derivative thereof) of the present invention is administered. As is 
well known, antibody compositions are administered, for example, to antagonize activity 
of CSP, or to target therapeutic agents to sites of CSP presence and/or accumulation. In a 
preferred embodiment, the antibody specifically binds to a CSP comprising an amino acid 
"> : : ':\ sequence of SEQ ID NO: 95-248, or a fusion protein, allelic variant, homolog, analog or 

25 derivative thereof. In a more preferred embodiment, the antibody specifically binds to a 
CSP encoded by a nucleic acid molecule having a nucleotide sequence of SEQ ID NO: 1- 
94, or a part, allelic variant, substantially similar or hybridizing nucleic acid thereof. 
The present invention also provides methods for identifying modulators which 
i ' bind to a CSP or have a modulatory effect on the expression or activity of a CSP. 

! 30 Modulators which decrease the expression or activity of CSP (antagonists) are believed to 

be useful in treating colon cancer. Such screening assays are known to those of skill in the 
art and include, without limitation, cell-based assays and cell-free assays. Small molecules 
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predicted via computer imaging to specifically bind to regions of a CSP can also be 
designed, synthesized and tested for use in the imaging and treatment of colon cancer. 
Further, libraries of molecules can be screened for potential anticancer agents by assessing 
the ability of the molecule to bind to the CSPs identified herein. Molecules identified in 
5 the library as being capable of binding to a CSP are key candidates for further evaluation 
for use in the treatment of colon cancer. In a preferred embodiment, these molecules will 
downregulate expression and/or activity of a CSP in cells. 

In another embodiment of the therapeutic methods of the present invention, a 
pharmaceutical composition comprising a non-antibody antagonist of CSP is administered. 
10 Antagonists of CSP can be produced using methods generally known in the art. In 
particular, purified CSP can be used to screen libraries of pharmaceutical agents, often 
combinatorial libraries of small molecules, to identify those that specifically bind and 
antagonize at least one activity of a CSP. 

In other embodiments a pharmaceutical composition comprising an agonist of a 
15 CSP is administered. Agonists can be identified using methods analogous to those used to 
identify antagonists. 

In a preferred embodiment, the antagonist or agonist specifically binds to and 
antagonizes or agonizes, respectively, a CSP comprising an amino acid sequence of SEQ 
ID NO: 95-248, or a fusion protein, allelic variant, homolog, analog or derivative thereof. 
20 In a more preferred embodiment, the antagonist or agonist specifically binds to and 

antagonizes or agonizes, respectively, a CSP encoded by a nucleic acid molecule having a 
nucleotide sequence of SEQ ID NO: 1-94, or a part, allelic variant, substantially similar or 
hybridizing nucleic acid thereof. 

Targeting Colon Tissue 

The invention also provides a method in which a polypeptide of the invention, or 
an antibody thereto, is linked to a therapeutic agent such that it can be delivered to the 
colon or to specific cells in the colon. In a preferred embodiment, an anti-CSP antibody is 
linked to a therapeutic agent and is administered to a patient in need of such therapeutic 
agent. The therapeutic agent may be a toxin, if colon tissue needs to be selectively 
destroyed. This would be useful for targeting and killing colon cancer cells. In another 
embodiment, the therapeutic agent may be a growth or differentiation factor, which would 
be useful for promoting colon cell function. 
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In another embodiment, an anti-CSP antibody may be linked to an imaging agent 
that can be detected using, e.g., magnetic resonance imaging, CT or PET. This would be 
useful for determining and monitoring colon function, identifying colon cancer tumors, 
and identifying noncancerous colon diseases. 

5 EXAMPLES 

Example la: Alternative Splice Variants 

We identified gene transcripts using the Gencarta™ tools (Compugen Ltd., Tel 
Aviv, Israel) and a variety of public and proprietary databases. These splice variants are 
either sequences which differ from a previously defined sequence or new uses of known 

1 0 sequences. In general related variants are annotated as DEX0450_XXX.nt 1 , 

DEX0450_XXX.nt.2, DEX0450_XXX.nt.3, etc. The variant DNA sequences encode 
proteins which differ from a previously defined protein sequence. In relation to the 
nucleotide sequence naming convention, protein variants are annotated as 
DEX0450_XXX.aa.l, DEX0450_XXX.aa.2, etc., wherein transcript DEX0450_XXX.nt.l 

1 5 encodes protein DEX0450_XXX.aa. 1 . A single transcript may encode a protein from an 
alternate Open Reading Fram (ORF) which is designated DEX0450_XXX.orf.l. 
Additionally, multiple transcripts may encode for a single protein. In this case, 
DEX0450_XXX.nt.l and DEX0450_XXX.nt2 will both be associated with 
DEX0450JOOCaa.l. 

20 The mapping of the nucleic acid ("NT") SEQ ID NO; DEX ID; chromosomal 

location (if known); open reading frame (ORF) location; amino acid ("AA") SEQ ID NO; 
AA DEX ID; are shown in the table below. 
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|2pl5 


|4360-6255||226 


(DEX0450 043. orf. 1 


82 


DEX0450_O44.nt.l 


[X;150551654- 
[150555252 


|82-562 |227 


JDEX0450 044.aa.l 


83 


DEX0450_044.nt.2 


[X;150551654- 
|l50575564 


1038-1976 


J228 


PEX0450 044. aa. 2 

r 


84 


DEX0450_044.nt.3 


[X;150551654- 
|l50575855 


|409-1311 


J229 


DEX0450_044 .aa.3 


85 


DEX0450_045 . nt . 1 ]|l2ql3 . 2 


jl-299 


|j23 0 


DEX0450 045.aa.l 


85 


DEX0450 045.nt.l ||l2ql3.2 


|4-393 


||231 


DEX0450 045. orf. 1 


86 


DEX045 0 046.nt.l |jl7q25.3 


|3-786 


||232 


DEX0450 046.aa.l 


86 


DEX0450_046 . nt . 1 ^~]jl7q25 . 3 


J3 5-763 


||233 


|DEX0450 046. orf. 1 


87 


DEX0450 047.nt.l 


|l9pl3.3 


(88-1375 j(234 


DEX0450 047.aa.l 


87 


DEX0450047 . nt . 1 |l9pl3 . 3 


|487-1347 1)235 


DEX0450 047. orf. 1 


88 


|DEX0450_048.nt.l 


2pl3 .2 


|22-766 


| 236 


DEX0450 048.aa.l 
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|88 ||DEX0450 048.nt.l H|2pl3.2 


290-751 


|237 [ DEX0450_048 . orf . 1 


|89 ||DEX0450 049.nt.l ||7q22.1 


139-944 


|238 ||DEX0450 049.aa.l 


|89 ||DEX0450 049.nt.l ~ ||7q22.1 


505-918 


|239 ||DEX0450 049. orf. 1 


|90 1|PEX0450 050.nt.l ||8q22.1 


268-970 


|240 ||DEX0450 OSO.aa.l ji 


[90 1|PEX0450 050.nt.l ||8q22 . 1 


15-971 


|241 ||DEX045 0 050. orf. 1 


|91 ||DEX0450 051.nt.l ||8p23.1 


58-287 


|242 ||DEX0450 OSl.aa.l 


|91 ||DEX0450 051.nt.l | 8p23 . 1 


2-217 


|243 ||DEX0450 051. orf. 1 


|92 ||DEX0450 051. nt. 2 l8p23 . 1 


49-235 


|244 ||DEX0450_051.aa.2 | 


92 ||DEX0450 051. nt. 2 ||8p23 . 1 


63-233 


|245 ||DEX0450 051. orf. 2 


|93 ||DEX0450 052.nt.l 


|9q34.11 


1-797 


|246 ]|DEX0450 052.aa.l • 


93 ||DEX0450 052.nt.l 


|9q34.11 


322-765 


|247 ||DEX0450 052. orf .1 


94 ||DEX0450 053.nt.l 


|8q22.1 J 


244-945 


|248 ||DEX0450 053.aa.l 



The polypeptides of the present invention were analyzed and the following 
attributes were identified; specifically, epitopes, post translational modifications, signal 
peptides and transmembrane domains. Antigenicity (Epitope) prediction was performed 
5 through the antigenic module in the EMBOSS package. Rice, P., EMBOSS: The 

European Molecular Biology Open Software Suite, Trends in Genetics 16(6): 276-277 
(2000). The antigenic module predicts potentially antigenic regions of a protein sequence, 
using the method of Kolaskar and Tongaonkar. Kolaskar, AS and Tongaonkar, PC, A 
semi-empirical method for prediction of antigenic determinants on protein antigens, FEBS 

10 Letters 276: 172-174 (1990). Examples of post-translational modifications (PTMs) and 
other motifs of the CSPs of this invention are listed below. In addition, antibodies that 
specifically bind such post-translational modifications may be useful as a diagnostic or as 
therapeutic. The PTMs and other motifs were predicted by using the ProSite Dictionary of 
Proteins Sites and Patterns (Bairoch et aL, Nucleic Acids Res. 25(1):217-221 (1997)), the 

15 following motifs, including PTMs, were predicted for the CSPs of the invention. The 
signal peptides were detected by using the SignalP 2.0, see Nielsen et aL, Protein 
Engineering 12, 3-9 (1999). Prediction of transmembrane helices in proteins was 
performed by the application TMHMM 2.0, "currently the best performing transmembrane 
prediction program", according to authors (Krogh et aL, Journal of Molecular Biology, 

20 305(3):567-580, (2001); Moller et aL, Bioinformatics, 17(7):646-653, (2001); 

Sonnhammer, et al.,A hidden Markov model for predicting transmembrane helices in 
protein sequences in Glasgow, et aL Ed Proceedings of the Sixth International 
Conference on Intelligent Systems for Molecular Biology, pages 175-182, Menlo Park, 
CA, 1998. AAAI Press. The PSORT n program may also be used to predict cellular 

25 localizations. Horton et aL, Intelligent Systems for Molecular Biology 5: 147-152 (1997). 
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The table below includes the following sequence annotations: Signal peptide presence; 
TM (number of membrane domain, topology in orientation and position); Amino acid 
location and antigenic index (location, AI score); PTM and other motifs (type, amino acid 
residue locations); and functional domains (type, amino acid residue locations). 
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Example lb: Sequence Alignment Support 

Alignments between previously identified sequences and splice variant sequences 
are performed to confirm unique portions of splice variant nucleic acid and amino acid 
sequences. The alignments are done using the Needle program in the European Molecular 
5 Biology Open Software Suite (EMBOSS) version 2.2.0 available at www.emboss.org 
from EMBnet (http://www.embnet.org). Default settings are used unless otherwise noted. 
The Needle program in EMBOSS implements the Needleman-Wunsch algorithm. 
Needleman, S. B., Wunsch, C. D., /. Mol Biol 48:443-453 (1970). 

It is well know to those skilled in the art that implication of alignment algorithms 
10 by various programs may result in minor changes in the generated output. These changes 
include but are not limited to: alignment scores (percent identity, similarity, and gap), 
display of nonaligned flanking sequence regions, and number assignment to residues. 
These minor changes in the output of an alignment do not alter the physical characteristics 
of the sequences or the differences between the sequences, e.g. regions of homology, 
1 5 insertions, or deletions. 

Example lc: RT-PCR Analysis 

To detect the presence and tissue distribution of a particular splice variant Reverse 
Transcription-Polymerase Chain Reaction (RT-PCR) is performed using cDNA generated 
from a panel of tissue RNAs. See, e.g., Sambrook et al, Molecular Cloning: A Laboratory 

20 Manual 2d ed., Cold Spring Harbor Laboratory Press (1989) and; Kawasaki ES et aL, 
PNAS 85(15):5698 (1988). Total RNA is extracted from a variety of tissues and first 
strand cDNA is prepared with reverse transcriptase (RT). Each panel includes 23 cDNAs 
from five cancer types (lung, ovary, breast, colon, and prostate) and normal samples of 
testis, placenta and fetal brain. Each cancer set is composed of three cancer cDNAs from 

25 different donors and one normal pooled sample. Using a standard enzyme kit from BD 
Bioscience Clontech (Mountain View, CA), the target transcript is detected with 
sequence-specific primers designed to only amplify the particular splice variant. The PCR 
reaction is run on the GeneAmp PCR system 9700 (Applied Biosystem, Foster City, CA) 
thermocycler under optimal conditions. One of ordinary skill can design appropriate 

30 primers and determine optimal conditions. The amplified product is resolved on an 

agarose gel to detect a band of equivalent size to the predicted RT-PCR product. A band 
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indicated the presence of the splice variant in a sample. The relation of the amplified 
product to the splice variant was subsequently confirmed by DNA sequencing. 

After subcloning, all positively screened clones are sequence verified. The DNA 
sequence verification results show the splice variant contains the predicted sequence 
5 differences in comparison with the reference sequence. 

Results for RT-PCR analysis in the table below include the sequence DEX ID, 
Lead Name, Cancer Tissue(s) the transcript was detected in, Normal Tissue(s) the 
transcript was detected in, the predicted length of the RT-PCR product, and the Confirmed 
Length of the RT-PCR product. 



DEX ID 


Lead 
Name 


Cancer 
Tissue (s) 


Normal 
Tissue (s) 


Predicted 
Length 


Confirmed 
Length 


DEX0450 007.nt.l 


Cln260 


Prostate 


Ovary 


365bp 


365bp 



10 

RT-PCR results confirm the presence SEQ ID NO: 1-94 in biologic samples and 
distinguish between related transcripts. 

Example Id: Secretion Assay 

To determine if a protein encoded by a splice variant is secreted from cells a 

15 secretion assay is preformed. A pcDNA3. 1 clone containing the gene transcript which 
encodes the variant protein is transfected into 293T cells using the Superfect transfection 
reagent (Qiagen, Valencia CA). Transfected cells are incubated for 28 hours before the 
media is collected and immediately spun down to remove any detached cells. The 
adherent cells are solubilized with lysis buffer (1% NP40, lOmM sodium phosphate 

20 pH7.0, and 0. 15M NaCl). The lysed cells are collected and spun down and the 

supernatant extracted as cell lysate. Western immunoblot is carried out in the following 
manner: 15jli1 of the cell lysate and media are run on 4-12% NuPage Bis-Tris gel 
(Invitrogen, Carlsbad CA), and blotted onto a PVDF membrane (Invitrogen, Carlsbad 
CA). The blot is incubated with a polyclonal primary antibody which binds to the variant 

25 protein (Imgenex, San Diego CA) and polyclonal goat anti-rabbit-peroxidase secondary 
antibody (Sigma-Aldrich, St. Louis MO). The blot is developed with the ECL Plus 
chemiluminescent detection reagent (Amersham Biosciences, Piscataway NJ). 

Secretion assay results are indicative of SEQ ID NO: 95-248 being a diagnostic 
marker and/or therapeutic target for cancer. 

30 Example 2a: Gene Expression Analysis 
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Custom Microairay Experiment - Cancer 

Custom oligonucleotide microairays were provided by Agilent Technologies, Inc. 
(Palo Alto, CA). The microarrays were fabricated by Agilent using their technology for 
the in-situ synthesis of 60mer oligonucleotides (Hughes, et al. 2001, Nature Biotechnology 
5 19:342-347). The 60mer microarray probes were designed by Agilent, from gene 
sequences provided by diaDexus, using Agilent proprietary algorithms. Whenever 
possible two different 60mers were designed for each gene of interest. 

All microarray experiments were two-color experiments and were preformed using 
Agilent-recommended protocols and reagents. Briefly, each microarray was hybridized 

10 with cRNAs synthesized from RNA (total RNA for ovarian and prostate, polyA+ RNA for 
lung, breast and colon samples), isolated from cancer and normal tissues, labeled with 
fluorescent dyes Cyanine3 (Cy3) or Cyanine5 (Cy5) (NEN Life Science Products, Inc., 
Boston, MA) using a linear amplification method (Agilent). In each experiment the 
experimental sample was RNA isolated from cancer tissue from a single individual and 

15 the reference sample was a pool of RNA isolated from normal tissues of the same organ as 
the cancerous tissue (i.e. normal ovarian tissue in experiments with ovarian cancer 
samples). Hybridizations were carried out at 60°C, overnight using Agilent in-situ 
hybridization buffer. Following washing, arrays were scanned with a GenePix 4000B 
Microarray Scanner (Axon Instruments, Inc., Union City, CA). The resulting images were 

20 analyzed with GenePix Pro 3.0 Microarray Acquisition and Analysis Software (Axon). 
Data normalization and expression profiling were done with Expressionist 
software from GeneData Inc. (Daly City, CA/Basel, Switzerland). Gene expression 
analysis was performed using only experiments that met certain quality criteria. The 
quality criteria that experiments must meet are a combination of evaluations performed by 

25 the Expressionist software and evaluations performed manually using raw and normalized 
data. To evaluate raw data quality, detection limits (the mean signal for a replicated 
negative control + 2 Standard Deviations (SD)) for each channel were calculated. The 
detection limit is a measure of non-specific hybridization. Acceptable detection limits 
were defined for each dye (<80 for Cy5 and <150 for Cy3). Arrays with poor detection 

30 limits in one or both channels were not analyzed and the experiments were repeated. To 
evaluate normalized data quality, positive control elements included in the array were 
utilized. These array features should have a mean ratio of 1 (no differential expression). 
If these features have a mean ratio of greater than 1 .5-fold up or down, the experiments 
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were not analyzed further and were repeated. In addition to traditional scatter plots 
demonstrating the distribution of signal in each experiment, the Expressionist software 
also has minimum thresholding criteria that employ user defined parameters to identify 
quality data. These thresholds include two distinct quality measurements: 1) minimum 

5 area percentage, which is a measure of the integrity of each spot and 2) signal to noise 
ratio, which ensures that the signal being measured is significantly above any background 
(nonspecific) signal present. Only those features that met the threshold criteria were 
included in the filtering and analyses carried out by Expressionist The thresholding 
settings employed require a minimum area percentage of 60% [(% pixels > background + 

10 2SD)-(% pixels saturated)], and a minimum signal to noise ratio of 2.0 in both channels. 
By these criteria, very low expressors, saturated features and spots with abnormally high 
local background were not included in analysis. 

Relative expression data was collected from Expressionist based on filtering and 
clustering analyses. Up-regulated genes were identified using criteria for the percentage 

1 5 of experiments in which the gene is up-regulated by at least 2-fold. In general, up- 
regulation in -30% of samples tested was used as a cutoff for filtering. 

Two microarray experiments were preformed for each normal and cancer tissue 
pair. The tissue specific Array Chip for each cancer tissue is a unique microarray specific 
to that tissue and cancer. The Multi-Cancer Array Chip is a universal microarray that was 

20 hybridized with samples from each of the cancers (ovarian, breast, colon, lung, and 

prostate). See the description below for the experiments specific to the different cancers. 

Microarray Experiments and Data Tables 
COLON CANCER CHIPS 

For colon cancer two different chip designs were evaluated with overlapping sets 
25 of a total of 38 samples, comparing the expression patterns of colon cancer derived 
polyA+ RNA to polyA+ RNA isolated from a pool of 7 normal colon tissues. For the 
Colon Array Chip all 38 samples (23 Ascending colon carcinomas and 15 Rectosigmoidal 
carcinomas including: 5 stage I cancers, 15 stage II cancers, 15 stage ffl and 2 stage IV 
cancers, as well as 28 Gradel/2 and 10 Grade 3 cancers) were analyzed. The 
30 histopathologic grades for cancer are classified as follows: GX, cannot be assessed; Gl, 
well differentiated; G2, Moderately differentiated; G3, poorly differentiated; and G4, 
undifferentiated. AJCC Cancer Staring Handbook , 5 th Edition, 1998, page 9. For the 
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Colon Array Chip analysis, samples were further divided into groups based on the 
expression pattern of the known colon cancer associated gene Thymidilate Synthase (TS) 
(1 3 TS up 25 TS not up). The association of TS with advanced colorectal cancer is well 
documented. Paradiso et al> Br J Cancer 82(3): 560-7 (2000); Etienne et ai, J Clin 
5 Oncol. 20(12):2832-43 (2002); Aschele et al Clin Cancer Res. 6(12):4797-802 (2000). 
For the Multi-Cancer Array Chip a subset of 27 of these samples (14 Ascending colon 
carcinomas and 13 Rectosigmoidal carcinomas including: 3 stage I cancers, 9 stage II 
cancers, 13 stage ED and 2 stage IV cancers) were assessed. 

The results for the statistically significant up-regulated genes on the Colon Array 

10 Chip are shown in Tables 1 and 2. The results for the statistically significant up-regulated 
genes on the Multi-Cancer Array Chip are shown in Table 3. 

The first two columns of each table contain information about the sequence itself 
(Seq ID, Oligo Name), the next columns show the results obtained for all ("ALL") the 
colon samples, ascending colon carcinomas ("ASC"), Rectosigmoidal carcinomas ("RS"), 

15 cancers corresponding to stages I and H ("ST1,2"), stages III and IV ("ST3,4"), grades 1 
and 2 ("GR1,2"), grade 3 ("GR3"), cancers exhibiting up-regulation of the TS gene 
("TSup") or those not exhibiting up-regulation of the TS gene ( £c NOT TSup"). '%up' 
indicates the percentage of all experiments in which up-regulation of at least 2-fold was 
observed n=38 for the Colon Array Chip (n=27 for the Multi-Cancer Array Chip), '%valid 

20 up' indicates the percentage of experiments with valid expression values in which up- 
regulation of at least 2-fold was observed. 



Table 1. 



DEX ID 


Oligo 
Name 


Cln 
ALL 
%up 
n=38 


Cln 

ALL 

%val 

id 

up 

n=38 


Cln 
ASC 
%up 
n=23 


Cln ASC 
%valid 
up n=23 


Cln 
RS 
%up 
n=15 


Cln 
RS % 
valid 
up 
n=15 


Cln 
STl ,2 
%up 
n=20 


Cln 
STl, 
2 % 
valid 
up 

n=20 


Cln 

ST3, 

4 

%up 
n=18 


Cln 
ST3,4 
%vali 
d up 
n=18 


DEX045 0_002. 
nt .1 


31003 .0 


5.3 


5.9 


4.3 


4.8 


6.7 


7.7 


0.0 


0.0 


11.1 


11.8 


DEX0450_002. 
nt .1 


31158.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_002. 
nt .1 


31159.0 


10.5 


10.5 


13.0 


13.0 


6.7 


6.7 


5.0 


5.0 


16.7 


16.7 


DEX0450_002. 
nt.l 


31162.0 


15.8 


16.2 


26.1 


26.1 


0.0 


0.0 


15.0 


15.8 


16.7 


16.7 


DEX0450_002. 
nt.l 


31163.0 


7.9 


7.9 


13.0 


13 .0 


0.0 


0.0 


5.0 


5.0 


11.1 


11.1 


DEX04 50_002. 
nt.l 


34074.0 


13 .2 


14.7 


17.4 


19.0 


6.7 


7.7 


5.0 


5.9 


22.2 


23.5 
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DEX0450__002. 
nt.l 


34075.0 


2.6 


2.7 


0.0 


0.0 


6.7 


6.7 


0.0 


0.0 


5.6 


5.9 


DEX0450_003. 
nt.l 


31446.0 


2.6 


3.1 


0.0 


0.0 


6.7 


7.1 


0.0 


0.0 


5.6 


6.7 


DEX0450_003. 
nt .2 


31447. 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_003 . 
nt .3 


31443.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 003. 
nt .3 


31447.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 . 


0.0 


DEX0450_004 . 
nt.l 


10720.0 


36.8 


36.8 


21.7 


21.7 


60.0 


60.0 


30.0 


30.0 


44.4 


44 .4 


DEX0450_004. 
nt.l 


10721.0 


36 .8 


36.8 


21.7 


21.7 


60.0 


60.0 


30.0 


30.0 


44 .4 


44.4 


DEX0450_005. 
nt.l 


16776.0 


2.6 


3.1 


4.3 


5.0 


0.0 


0.0 


0.0 


0.0 


5.6 


7.1 


DEX0450_005 . 
nt.l 


16777.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_005 . 
nt.l 


38049.0 


47 .4 


47.4 


43.5 


43 .5 


53 .3 


53.3 


50.0 


50.0 


44.4 


44.4 


DEX0450 005. 
nt . 1 


38050.0 


55.3 


55.3 


47.8 


47*8 


66.7 


66.7 


60.0 


60.0 


50.0 


50.0 


DEX0450 006. 
nt.l 


35170.0 


28.9 


84.6 


21.7 


71.4 


40.0 


100.0 


30.0 


85.7 


27.8 


83 .3 


DEX0450 006. 
nt.l 


35171.0 


21.1 


40.0 


21.7 


35.7 


20.0 


50.0 


25.0 


45.5 


16.7 


33 .3 


DEX0450__007. 
nt . 1 


35170.0 


28.9 


84.6 


21.7 


71.4 


40.0 


100.0 


30.0 


85.7 


27.8 


83.3 


DEX0450_007. 
nt . 1 


35171.0 


21.1 


40.0 


21.7 


35.7 


20.0 


50.0 


25.0 


45.5 


16. 7 


33.3 


DEX0450__008. 
nt . 1 


30227.0 


50.0 


50.0 


60.9 


60.9 


33 .3 


33.3 


55.0 


55.0 


44.4 


44.4 


DEX0450_008. 
nt.l 


30228.0 


44 .7 


44 .7 


56.5 


56.5 


26.7 


26.7 


45.0 


45.0 


44 .4 


44.4 


DEX0450_009. 
nt.l 


39839.0 


10.5 


10.5 


13.0 


13 .0 


6.7 


6.7 


0.0 


0.0 


22.2 


22.2 


DEX0450_009. 
nt.l 


39840.0 


10.5 


10.8 


13 .0 


13 .6 


6.7 


6.7 


0.0 


0.0 


22 .2 


22.2 


DEX0450_009. 
nt .2 


39839.0 


10.5 


10.5 


13 .0 


13 .0 


6.7 


6.7 


0.0 


0.0 


22 .2 


22.2 


DEX0450_009. 
nt .2 


39840.0 


10.5 


10.8 


13.0 


13 .6 


6.7 


6.7 


0.0 


0.0 


22.2 


22.2 


DEX045 0_010. 
nt . 1 


29571.0 


31.6 


31.6 


43 .5 


43 .5 


13.3 


13.3 


25.0 


25 .0 


38.9 


38.9 


DEX0450 010. 
nt.l 


29582.0 


52.6 


52.6 


60.9 


60.9 


40.0 


40.0 


45.0 


45.0 


61.1 


61.1 


DEX0450 010. 
nt.l 


29595.0 


52.6 


52.6 


56.5 


56.5 


46.7 


46.7 


45.0 


45.0 


61.1 


61.1 


DEX0450_010. 
nt.l 


29609.0 


47.4 


47.4 


52.2 


52 .2 


40.0 


40.0 


40.0 


40.0 


55.6 


55.6 


DEX0450_010. 
nt . 1 


29611 . 0 


52 . 6 


52 . 6 


56 . 5 


56 . 5 


46 . 7 


46 . 7 


45 . 0 


45 . 0 


61 . 1 


61 . 1 


DEX0450_010. 
nt . 1 


29612.0 


50.0 


50.0 


52.2 


52.2 


46.7 


46.7 


45.0 


45.0 


55.6 


55.6 


DEX0450_010. 
nt .2 


29582.0 


52.6 


52.6 


60.9 


60.9 


40.0 


40.0 


45.0 


45.0 


61. .1 


61.1 


DEX0450_010. 
nt .2 


29595.0 


52.6 


52.6 


56.5 


56.5 


46.7 


46.7 


45.0 


45.0 


61.1 


61.1 
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DEX0450_010. 
nt .2 


29609. 0 


47.4 


47.4 


52.2 


52.2 


40.0 


40.0 


40.0 


40.0 


55.6 


55.6 


DEX0450_010. 
nt . 2 


29611.0 


52.6 


52.6 


56.5 


56.5 


46.7 


46.7 


45.0 


45.0 


61.1 


61.1 


DEX0450 010. 
nt . 2 " T 


29612.0 


50.0 


50 .0 


52 .2 


52 .2 


46.7 


46. 7 


45.0 


45.0 


55.6 


55.6 


DEX0450 010. 
nt.3 


29571.0 


31.6 


31.6 


43 .5 


43 .5 


13.3 


13 .3 


25. 0 


25.0 


38.9 


38.9 j 


DEX0450_010. 
nt.3 


29572.0 


5.3 


5.3 


4.3 


4.3 


6.7 


6.7 


10.0 


10.0 


0.0 


0.0 


DEX0450_010. 
nt . 3 


29582.0 


52.6 


52.6 


60.9 


60.9 


40.0 


40. 0 


45.0 


45 .0 


61.1 


61.1 


DEX0450_010. 
nt.3 


29595.0 


52.6 


52.6 


56.5 


56.5 


46.7 


46.7 


45.0 


45 .0 


61.1 


61.1 


DEX0450_010. 
nt . 3 


29609.0 


47.4 


47.4 


52.2 


52.2 


40.0 


40.0 


40.0 


40. 0 


55.6 


55.6 


DEX0450_010. 
nt.3 


29611.0 


52.6 


52 .6 


56.5 


56.5 


46.7 


46.7 


45.0 


45 .0 


61.1 


61.1 


DEX0450_010. 
nt.3 ! 


29612.0 


50.0 


50.0 


52.2 


52.2 


46.7 


46.7 


45.0 


45.0 


55.6 


55.6 


DEX0450_010. 
nt.3 


36747.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 Oil. 
nt . 1 


22654.0 


10.5 


10.5 


17.4 


17.4 


0.0 


0.0 


10.0 


10.0 


11.1 


11.1 


DEX0450_012. 
nt .1 


8377.0 


13 .2 


13.2 


21.7 


21.7 


0.0 


0.0 


15.0 


15.0 


11.1 


11.1 


DEX0450_013 . 
nt .1 


32220.0 


47.4 


47.4 


43.5 


43.5 


53 .3 


53.3 


30.0 


30.0 


66.7 


66 .7 


DEX0450_013 . 
nt .1 


32221.0 


60.5 


63 .9 


52.2 


54.5 


73 .3 


78.6 


50.0 


52.6 


72 .2 


76.5 


DEX0450_013. 
nt.l 


32254.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_013. 
nt . 1 


32255.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 013. 
nt.l 


33033 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_014. 
nt . 1 


32458.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_014. 
nt.l 


32459.0 


5.3 


5.4 


8.7 


8.7 


0.0 


0.0 


0.0 


0.0 


11.1 


11.1 


DEX0450_015. 
nt.l 


33503.0 


18.4 


18.9 


30.4 


31.8 


0.0 


0.0 


15.0 


15.8 


22.2 


22.2 


DEX0450_016. 
nt . 1 


33083.0 


5.3 


5.3 


4.3 


4.3 


6.7 


6.7 


10.0 


10.0 


0.0 


0.0 


DEX0450_016. 
nt . 1 


35091.0 


21.1 


21.1 


30.4 


30.4 


6.7 


6.7 


20.0 


20.0 


22.2 


22.2 


DEX0450 016. 
nt . 2 


33083.0 


5.3 


5.3 


4.3 


4.3 


6.7 


6.7 


10.0 


10.0 


0.0 


0.0 


DEX0450_016. 
nt . 2 


35091.0 


21.1 


21.1 


30.4 


30.4 


6.7 


6.7 


20.0 


20.0 


22.2 


22.2 


DEX0450_016. 
nt . 3 


33083 . 0 


5 . 3 


5 . 3 


4 . 3 


4 . 3 


6 . 7 


6 . 7 


10 . 0 


10 . 0 


0 . 0 


0 . 0 


DEX0450_016. 
nt.3 


35091.0 


21.1 


21.1 


30.4 


30.4 


6.7 


6.7 


20.0 


20.0 


22 .2 


22 .2 


DEX0450_017. 
nt.l 


36711.0 


2.6 


2.6 


0.0 


0.0 


6.7 


6.7 


0.0 


0.0 


5.6 


5.6 


DEX0450_017. 
nt.l 


36712.0 


2.6 


2.8 


0.0 


0.0 


6.7 


7.1 


0.0 


0.0 


5.6 


5.9 
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DEX0450 017. 
nt.l 


39631.0 


2 .6 


2 . 8 


0 . 0 


0 . 0 


6 . 7 


7 . 1 


0 . 0 


D . 0 


5 . 6 


5 . 9 


DEX0450 017. 
nt.l 


39635.0 


5.3 


5.3 


4 . 3 


4 . 3 


6 . 7 


6 - 7 


0 . 0 


0 . 0 


11.1 


11 . 1 


DEX0450_017. 
nt.l 


39636.0 


2 .6 


2.6 


0 . 0 


0 . 0 


6 . 7 


6 . 7 


0 . 0 


0 . 0 


5 . 6 


5 . 6 


DEX0450_017 . 
nt.l 


39769.0 


13 .2 


13 .2 


17.4 


17 . 4 


6 . 7 


6 . 7 


5 . 0 


5 . 0 


22 . 2 


22 . 2 


DEX0450_017. 
nt.l 


39770.0 


5.3 


5 . 3 


4 . 3 


4 .3 


6 . 7 


6 . 7 


0 . 0 


0 . 0 


11 . 1 


11 . 1 


DEX0450_017. 
nt.2 


36711.0 


2.6 


2.6 


0 . 0 


0 . 0 


6 . 7 


6 . 7 


0 . 0 


0 . 0 


5 . 6 


5 . 6 


DEX0450_017. 
nt.2 


39631.0 


2 .6 


2 . 8 


0 . 0 


0 . 0 


6 . 7 


7 . 1 


0 . 0 


0 . 0 


5 . 6 


5 . 9 


DEX0450_017. 
nt.2 


39635 .0 


5.3 


5 . 3 


4 . 3 


4 . 3 


6 . 7 


6 . 7 


0 . 0 


0 . 0 


11 . 1 


11.1 


DEX0450_017. 
nt.2 


39636.0 


2.6 


2 .6 


0 . 0 


0 . 0 


6 . 7 


6 . 7 


0 . 0 


0 . 0 


5 . 6 


5 . 6 


DEX0450_017. 
nt.2 


39769.0 


13 .2 


13.2 


17 .4 


17. 4 


6 . 7 


6 - 7 


5 . 0 


5 . 0 


22 . 2 


22.2 


DEX0450_017. 
nt . 2 


39770.0 


5.3 


5.3 


4.3 


4.3 


6 . 7 


6 . 7 


0 . 0 


0 . 0 


11 . 1 


11 . 1 


DEX0450_017. 
nt . 3 


39631.0 


2.6 


2.8 


0.0 


0.0 


6 . 7 


7 . 1 


0 . 0 


0 . 0 


5 . 6 


5 . 9 


DEX0450_017. 
nt.3 


39635.0 


5.3 


5.3 


4.3 


4.3 


6 . 7 


6 . 7 


0 . 0 


0 . 0 


11 . 1 


11 . 1 


DEX0450_017. 
nt.3 


39636.0 


2.6 


2-6 


0.0 


0 . 0 


6.7 


6 . 7 


0 . 0 


0 . 0 


5 . 6 


5 . 6 


DEX0450_017. 
nt.3 


39769.0 


13 .2 


13 .2 


17 .4 


17 .4 


6 . 7 


6 . 7 


5 . 0 


5 . 0 


22 . 2 


22 . 2 


DEX0450_018. 
nt.l 


31424.0 


10.5 


11.8 


13 . 0 


13 . 6 


6 . 7 


8 . 3 


10 . 0 


11 . 1 


11 . 1 


12 . 5 


DEX0450_018. 
nt.l 


31425.0 


10.5 


12.1 


13 . 0 


14 . 3 


6 . 7 


8 . 3 


5 . 0 


5 . 3 


16 . 7 


21.4 


DEX0450_018. 
nt.2 


31424.0 


10.5 


11. 8 


13 .0 


13 . 6 


6 . 7 


8 . 3 


10 . 0 


11 . 1 


11 . 1 


12 . 5 


DEX0450_018 . 
nt.2 


31425 .0 


10.5 


12 . 1 


13 .0 


14 . 3 


6 . 7 


8 . 3 


5 . 0 


5 . 3 


16 . 7 


21.4 


DEX0450_018. 
nt.3 


31424 .0 


10 .5 


11.8 


13 . 0 


13 .6 


6 . 7 


8 . 3 


10 . 0 


11 . 1 


11 . 1 


12 . 5 


DEX0450_018. 
nt.3 


31425 .0 


10 .5 


12 . 1 


13 . 0 


14 . 3 


6 . 7 


8 . 3 


5 . 0 


5 . 3 


16 . 7 


21.4 


DEX0450_019. 
nt.l 


32748.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0450_019 . 
nt.l 


33066.0 


10.5 


10.5 


17.4 


17.4 


0 . 0 


0 . 0 


20 . 0 


20 . 0 
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6 . 7 


10 . 0 


10 . 0 


16 . 7 


16 . 7 


DEX0450_041. 
nt. 1 


36755 . 0 


5 . 3 


5 . 9 


4 . 3 


4 . 5 


6 . 7 


8 . 3 


0 . 0 


0 . 0 


11 . 1 


13 . 3 


DEX0450_041. 
nt . 1 


36756 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0450_041. 
nt . 1 


37290 . 0 


5 .3 


5 . 3 


8 . 7 


8 . 7 


0 . 0 


0 . 0 


5 . 0 


5 . 0 


5 . 6 


5 . 6 


DEX0450_042. 
nt . 1 


33118 . 0 


7 . 9 


7 . 9 


13 . 0 


13 . 0 


0 . 0 


0 . 0 


10 . 0 


10 . 0 


5 . 6 


5 . 6 


DEX0450_042 . 
nt. 1 


33119 . 0 


5 .3 


5.3 


8 . 7 


8 . 7 


0 . 0 


0 . 0 


5 . 0 


5 . 0 


5 . 6 


5 . 6 


DEX0450_042 . 
nt . 1 


34009 . 0 


26 .3 


27 . 0 


26 . 1 


27 .3 


26 . 7 


26 . 7 


25 . 0 


25 . 0 


27 . 8 


29 . 4 


DEX0450_042 . 
nt . 1 


40737 . 0 


13 .2 


13 .2 


21 . 7 


21 . 7 


0 . 0 


0 . 0 


10 . 0 


10 . 0 


16 . 7 


16 . 7 


DEX0450_042 . 
nt . 2 


33118 . 0 


7 . 9 


7 . 9 


13 . 0 


13 . 0 


0 . 0 


0 . 0 


10 . 0 


10 . 0 


5 . 6 


5 . 6 


DEX0450_042 . 
nt .2 


33119 . 0 


5 . 3 


5 . 3 


8 . 7 


8 . 7 


0 . 0 


0 . 0 


5 . 0 


5 . 0 


5 . 6 


5 . 6 


DEX0450_042 . 
nt.2 


34009 . 0 


26 . 3 


27 . 0 


26 . 1 


27 . 3 


26 . 7 


26 . 7 


25 . 0 


25 . 0 


27 . 8 


29 . 4 


DEX0450_042 . 
nt.2 


40737 . 0 


13 . 2 


13 . 2 


21 . 7 


21.7 


0 . 0 


0 . 0 


10 . 0 


10 . 0 


16 . 7 


16 . 7 


DEX0450_J)43 . 
nt.l 


35881 . 0 


2 . 6 


3 . 0 


0 . 0 


0 . 0 


6 . 7 


7 . 1 


0 . 0 


0 . 0 


5 . 6 


6 . 2 


DEX0450_043 . 
nt.l 


35882 . 0 


2 . 6 


14 . 3 


4 . 3 


20 .0 


0 . 0 


0 . 0 


5 . 0 


25 . 0 


0 . 0 


0 . 0 


DEX0450_043 . 
nt . 1 


36669 . 0 


28 . 9 


28 . 9 


30 . 4 


30 . 4 


26 . 7 


26 . 7 


25 . 0 


25 . 0 


33 . 3 


33 . 3 


DEX0450_043 . 
nt.l 


36670 . 0 


28 . 9 


28 . 9 


30.4 


30 . 4 


26 . 7 


26 . 7 


25 . 0 


25 . 0 


33 . 3 


33.3 


DEX0450_044. 
nt . 1 


9110 . 0 


28 . 9 


28 . 9 


30.4 


30.4 


26 . 7 


26 - 7 


30 . 0 


30.0 


27 . 8 


27 . 8 


DEX0450_045 . 
nt. 1 


13783 . 0 


15 . 8 


15 . 8 


21 . 7 


21 . 7 


6 . 7 


6 . 7 


20 . 0 


20 . 0 


11 . 1 


11 . 1 


DEX0450_045 . 
nt.l 


13784 . 0 


13 . 2 


13 . 2 


17. 4 


17 . 4 


6 . 7 


6 . 7 


15 . 0 


15 . 0 


11 . 1 


11 . 1 


DEX0450_046. 
nt . 1 


10297 . 0 


63 . 2 


63 . 2 


73 . 9 


73 . 9 


46 . 7 


46 . 7 


65 . 0 


65 . 0 


61 . 1 


61 . 1 


DEX0450_047. 
nt . 1 




^ J . / 




Id A. 

j U . ft 






1-3 . J 




zl . 1 




0*7 Q 
Z, I . o 


DEX0450_047. 
nt . 1 


38789.0 


7.9 


7.9 


4.3 


4.3 


13 .3 


13.3 


5.0 


5.0 


11.1 


11.1 


DEX0450_047. 
nt . 1 


39412.0 


7.9 


7.9 


4.3 


4.3 


13.3 


13.3 


5.0 


5.0 


11.1 


11.1 


DEX04 50_04 8. 
nt . 1 


41177.0 


7.9 


7.9 


13.0 


13 .0 


0.0 


0.0 


5.0 


5.0 


11.1 


11.1 
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DEX0450_04 8 . 
nt . 1 


41178 . 0 


10.5 


10 . 8 


17.4 


17 .4 


0 . 0 


0 . 0 


10 . 0 


10 . 0 


11 . 1 


11 . 8 


DEX0450_049. 
nt.l 


31424 . 0 


10 . 5 


11 . 8 


13 . 0 


13 .6 


6 . 7 


8 . 3 


10 . 0 


11 . 1 


11 . 1 


12 . 5 


DEX0450_04 9. 
nt.l 


31425 . 0 


10 . 5 


12 . 1 


13 . 0 


14 . 3 


6 . 7 


8 . 3 


5 . 0 


5 . 3 


16 . 7 


21 . 4 


DEX0450_050. 
nt.l 


9398 . 0 


26 . 3 


26 . 3 


30 . 4 


30 . 4 


20.0 


20.0 


3 0.0 


30.0 


22.2 


22 . 2 


DEX0450_050. 
nt.l 


9399.0 


26 . 3 


26 . 3 


30.4 


30.4 


20.0 


20 . 0 


30.0 


30.0 


22 . 2 


22.2 


DEX0450_052. 
nt.l 


T *7 Q A *1 A. 

3 / y 4 j . u 


CO c 


CO c 


DO . D 


DO . _> 


fl b . / 


4o . / 


c. n n 


jU . U 


DO . O 


c. c. c 


DEX0450_052. 
nt.l 


37944.0 


50.0 


51.4 


56.5 


56.5 


40.0 


42.9 


50.0 


52.6 


50.0 


50.0 


DEX0450_053 . 
nt.l 


9398.0 


26.3 


26.3 


30.4 


30.4 


20.0 


20.0 


30.0 


30.0 


22.2 


22.2 


DEX0450_053 . 
nt.l 


9399.0 


26.3 


26.3 


30.4 


30.4 


20.0 


20.0 


30.0 


30.0 


22.2 


22.2 



Table 2. 



DEX ID 


Oligo 
Name 


Cln 
GR1,2 
%up 
n=28 


Cln 

GR1,2 

%valid 

up 

n=28 


Cln 
GR3 
%up 
n=10 


Cln 
GR3 

%valid 

up 

n=10 


Cln ! 
TS 

U P 

%up 

n=13 


Cln TS 
up 

%valid 

up 

n=13 


Cln 
NOT 
TS 
up 

o 

■sup 
n=25 


Cln NOT 
TS up 
%valid 
up n=25 


DEX0450 002. nt.l 


31003 . 0 


0 . 0 


0 . 0 


20 . 0 


22 . 2 


15 . 4 


15 - 4 


0 . 0 


0 . 0 


DEX0450 002. nt.l 


31158 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


A A 


0 . 0 


DEX0450 002. nt.l 


31159 . 0 


0 . 0 


0 . 0 


40.0 


40.0 


23 . 1 


A3 . 1 


A A 


a r\ 
4 . U 




31162 . 0 


14 . 3 


14 . 8 


20.0 


20.0 


23 . 1 


23 . 1 


12 . 0 


12 . 5 


DEX0450 002. nt.l 


31163.0 


3.6 


3.6 


20.0 


20.0 


15.4 


15.4 


4.0 


4.0 


DEX0450 002. nt.l 


34074.0 


3.6 j 


4.0 


40.0 


44 .4 


30.8 


33.3 


4.0 


4.5 


DEX0450 002. nt.l 


34075.0 


0.0 


0.0 


10.0 


11.1 


7.7 


7.7 


0.0 


0.0 


DEX0450 003. nt.l 


31446.0 


0.0 


0.0 


10.0 


14.3 


7.7 


11.1 


0.0 


0.0 


DEX0450 003. nt. 2 


31447.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 003. nt. 3 


31443.0 


0.0 


0.0 


0.0 J 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 003. nt. 3 


31447.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 004. nt.l 


10720.0 


42.9 


42.9 


20.0 


20. 0 


7.7 


7.7 


52.0 


52.0 


DEX0450 004. nt.l 


10721.0 


46.4 


46.4 


10.0 


10.0 


15.4 


15.4 


48.0 


48.0 


DEX0450 005. nt.l 


16776.0 


3.6 


4.0 


0.0 


0.0 


7.7 


8.3 


0.0 


0.0 


DEX0450 005. nt.l 


16777.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 ! 


DEX0450 005. nt.l 


38049.0 


50.0 


50.0 


40.0 


40.0 


61.5 


61.5 


40.0 


40.0 


DEX0450 005. nt.l 


38050.0 


60.7 


60.7 


40.0 


40.0 


61.5 


61.5 


52.0 


52.0 


DEX0450 006. nt.l 


35170.0 


35.7 


83.3 


10.0 


100.0 


7.7 


33.3 


40.0 


100.0 


DEX0450 006. nt.l 


35171.0 


28.6 


47.1 


0.0 


0.0 


7.7 


16.7 


28.0 


50.0 


DEX0450 007. nt.l 


35170.0 


35.7 


83.3 


10.0 


100.0 


7.7 


33.3 


40.0 


100. 0 


DEX0450 007. nt.l 


35171.0 


28.6 


47.1 


0.0 


0.0 


7.7 


16.7 


28.0 


50.0 


DEX0450 008. nt.l 


30227.0 


42.9 


42.9 


70.0 


70.0 


61.5 


61.5. . 


44.0 


44 .0 


DEX0450 008. nt.l 


30228.0 


35.7 


35.7 


70.0 


70.0 


53.8 


53 .8 


40.0 


40.0 


DEX0450 009. nt.l 


39839.0 


3.6 


3.6 


30.0 


30.0 


23 .1 


23 .1 


4.0 


4.0 


DEX0450 009. nt.l 


39840.0 


3.6 


3.7 


30.0 


30.0 


23 .1 


23 .1 


4.0 


4.2 


DEX0450 009. nt. 2 


39839.0 


3.6 


3.6 


30.0 


30.0 


23 .1 


23 .1 


4.0 


4.0 


DEX0450 009. nt. 2 


39840.0 


3.6 


3.7 


30.0 


30.0 


23 .1 


23 .1 


4.0 


4.2 


DEX0450 010. nt.l 


29571.0 


28.6 


28.6 


40.0 


40.0 


46.2 


46.2 


24.0 


24.0 


DEX0450 010. nt.l 


29582.0 


46.4 


46.4 


70.0 


70.0 


61.5 


61.5 


48.0 


48.0 


DEX0450 010. nt.l 


29595.0 


50.0 


50.0 


60.0 


60.0 


69.2 


69.2 


44.0 


44 .0 


DEX0450 010. nt.l 


29609.0 


46.4 


46.4 


50.0 


50.0 


61.5 


61.5 


40.0 


40.0 
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DEX0450 OlO.nt.l 


29611.0 


50.0 


50.0 


60.0 


60.0 


69.2 


69.2 


44 .0 


44.0 


DEX0450 OlO.nt.l 


29612.0 


50.0 


50.0 


50.0 


50.0 


69.2 


69.2 


40.0 


40.0 


DEX0450 010. nt. 2 


29582.0 


46.4 


46.4 


70.0 


70.0 


61.5 


61.5 


48.0 


48.0 


DEX0450 010. nt. 2 


29595 .0 


50.0 


50.0 


60.0 


60.0 


69.2 


69.2 


44 .0 


44.0 


DEX0450 010. nt. 2 


29609.0 


46.4 


46.4 


50.0 


50.0 


61.5 


61.5 


40.0 


40.0 


DEX0450 010. nt. 2 


29611.0 


50.0 


50.0 


60.0 


60.0 


69.2 


69.2 


44.0 


44.0 


DEX0450 010. nt. 2 


29612.0 


50.0 


50.0 


50.0 


50.0 


69.2 


69.2 


40.0 


40.0 


DEX0450 010. nt. 3 


29571.0 


28.6 


28.6 


40.0 


40.0 


46.2 


46.2 


24.0 


24.0 


DEX0450 010. nt. 3 


29572 .0 


7.1 


7.1 


0.0 


0.0 


7.7 


7.7 


4.0 


4.0 


DEX0450 010. nt. 3 


29582.0 


46.4 


46.4 


70. 0 


70.0 


61.5 


61.5 


48.0 


48.0 


DEX0450 010. nt. 3 


29595.0 


50.0 


50.0 


60.0 


60.0 


69.2 


69.2 


44 .0 


44 .0 


DEX0450 010. nt. 3 


29609.0 


46.4 


46.4 


50. 0 


50.0 


61.5 


61.5 


40.0 


40.0 


DEX0450 010. nt. 3 


29611 .0 


50.0 


50.0 


60.0 


60.0 


69.2 


69.2 


44.0 


44.0 


DEX0450 010. nt. 3 


29612 .0 


50.0 


50.0 


50.0 


50.0 


69.2 


69.2 


40.0 


40.0 


DEX0450 010. nt. 3 


36747.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 Oll.nt.l 


22654 .0 


3.6 


3.6 


30.0 


30.0 


15.4 


15.4 


8.0 


8 . 0 


DEX0450 012.nt.l 


8377.0 


10.7 


10.7 


20.0 


20.0 


30.8 


30.8 


4.0 


4.0 


DEX0450 013.nt.l 


32220 .0 


42. 9 


42 .9 


60. 0 


60.0 


53.8 


53.8 


44.0 


44 . 0 


DEX0450 013.nt.l 


32221 .0 


57.1 


59.3 


70.0 


77.8 


84.6 


84.6 


48.0 


52.2 


DEX0450 013.nt.l 


32254 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 013.nt.l 


32255 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 013.nt.l 


33033 .0 


0.0 


0.0 


0.0 , 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 014.nt.l 


32458.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 014.nt.l 


32459.0 


3.6 


3.7 


10.0 


10.0 


7.7 


7.7 


4.0 


4.2 


DEX0450 015.nt.l 


33503 .0 


14.3 


14.8 


30.0 


30.0 


38.5 


38.5 


8.0 


8.3 


DEX0450 016.nt.l 


33083 .0 


7.1 


7.1 


0.0 


0.0 


0.0 


0.0 


8.0 


8.0 


DEX0450 016.nt.l 


35091 .0 


28.6 


28.6 


0.0 


0.0 


38.5 


38.5 


12.0 


12.0 


DEX0450 016. nt. 2 


33083 .0 


7.1 


7.1 


0.0 


0.0 


0.0 


0.0 


8.0 


8.0 


DEX0450 016. nt. 2 


35091.0 


28.6 


28.6 


0.0 


0.0 


38.5 


38.5 


12.0 


12 .0 


DEX0450 016. nt. 3 


33083 .0 


7.1 


7.1 


0.0 


0.0 


0.0 


0.0 


8.0 


8.0 


DEX0450 016. nt. 3 


35091.0 


28.6 


28.6 


0.0 


0.0 


38.5 


38 .5 


12.0 


12.0 


DEX0450 017.nt.l 


36711 . 0 


0.0 


0.0 


10.0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX0450 017.nt.l 


36712.0 


0.0 


0.0 


10.0 


10.0 


7.7 


8.3 


0.0 


0.0 


DEX0450 017.nt.l 


39631.0 


0.0 


0.0 _j 


10.0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX0450 017.nt.l 


39635.0 


0.0 


0.0 


20. 0 


20.0 


7.7 


7.7 


4.0 


4.0 


DEX0450 017.nt.l 


39636 .0 


0.0 


0.0 


10.0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX0450 017.nt.l 


39769.0 


7.1 


7.1 


30. 0 


30.0 


23.1 


23 .1 


8.0 


8.0 


DEX0450 017.nt.l 


39770.0 


0.0 


0.0 


20.0 


20.0 


7.7 


7.7 


4.0 


4.0 


DEX0450 017. nt. 2 


36711.0 


0.0 


0.0 


10. 0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX0450 017. nt. 2 


39631 .0 


0.0 


0.0 


10. 0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX0450 017. nt. 2 


39635.0 


0.0 


0.0 


20.0 


20.0 


7.7 


7.7 


4.0 


4.0 


DEX0450 017. nt. 2 


39636 .0 


0.0 


0.0 


10.0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX0450 017. nt. 2 


39769.0 


7.1 


7.1 


30.0 


30.0 


23 . 1 


23 .1 


8.0 


8.0 


DEX0450 017. nt. 2 


39770.0 


0.0 


0.0 


20.0 


20.0 


7.7 


7.7 


4.0 


4.0 


DEX0450 017. nt. 3 


39631.0 


0.0 


0.0 


10.0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX0450 017. nt. 3 


39635.0 


0.0 


0.0 


20.0 


20.0 


7.7 


7.7 


4.0 


4.0 


DEX0450 017. nt. 3 


39636.0 


0.0 


0.0 


10.0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX0450 017. nt. 3 


39769.0 


7.1 


7.1 


30.0 


30.0 


23 .1 


23 .1 


8.0 


8.0 


DEX0450 018.nt.l 


31424.0 


7.1 


8.0 


20.0 


22.2 


23.1 


23.1 


4.0 


4.8 


DEX0450 018.nt.l 


31425 .0 


3.6 


4.0 


30.0 


37.5 


30.8 


30.8 


0.0 


0.0 


DEX0450 018. nt. 2 


31424 .0 


7.1 


8.0 


20.0 


22.2 


23 .1 


23.1 


4.0 


4 . 8 


DEX0450 018. nt. 2 


31425.0 


3.6 


4.0 


30.0 


37.5 


30.8 


30.8 


0.0 


0.0 


DEX0450 018. nt. 3 


31424.0 


7.1 


8.0 


20.0 


22.2 


23.1 


23 .1 


4.0 


4.8 


DEX0450 018. nt. 3 


31425.0 


3.6 


4.0 


30.0 


37.5 


30.8 


30.8 


0.0 


0.0 


DEX0450 019.nt.l 


32748 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 019.nt.l 


33066 .0 


10.7 


10.7 


10.0 


10.0 


15.4 


15.4 


8.0 


8.0 


DEX0450 019.nt.l 


33067.0 


14.3 


14.3 


20.0 


20.0 


30.8 


30.8 


8.0 


8.0 



WO 2004/050858 



PCT/US2003/038808 



209 



Ufc»A0450 019.nt.l 


35570 . 0 


14.3 


14 . 8 


20 . 0 


20 . 0 


30.8 


30 . 8 


8.0 


8.3 


uoau4S0 oi9.nt.l 


35571 . 0 


14 .3 


14.3 


30 . 0 


30 . 0 


30 . 8 


30 . 8 


12.0 


12.0 


Uii,A04bQ 019.nt.2 


32748 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


uc»AU4bu oi9.nt.2 


33066 . 0 


10 . 7 


10 . 7 


10 . 0 


10 . 0 


15 . 4 


15 .4 


8.0 


8.0 


]Jfc,A04bO 019. nt. 2 


33067 . 0 


14 . 3 


14 .3 


20 . 0 


20 . 0 


30 . 8 


30 .8 


8.0 


8.0 


UiiA04bO 019. nt. 2 


35570 . 0 


14 . 3 


14 . 8 


20 . 0 


20 . 0 


30.8 


30.8 


8.0 


8.3 


DE»AU4bO 019. nt . 2 


35571 . 0 


14 . 3 


14 . 3 


30.0 


30 . 0 


30 . 8 


30 .8 


12.0 


12.0 


DCYnACft A 1 Q i 

uciau^du 019. nt . J 


32748 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


0.0 ! 


UtAU4oO 019. nt. 3 


33066 . 0 


10.7 


10 . 7 


10 . 0 


10 . 0 


15 .4 


15 .4 


8.0 


8.0 


Uc»AO4b0 019. nt. J 


33067 . 0 


14 . 3 


14 . 3 


20 . 0 


20 . 0 


30 . 8 


30 . 8 


8.0 


8.0 


UciA04bO 019. nt. 3 


35570 . 0 


14 . 3 


14 . 8 


20 . 0 


20 . 0 


30 . 8 


30.8 


8.0 


8.3 


JJliA04bO 019. nt. 3 


35571 . 0 


14 . 3 


14 .3 


30 . 0 


30.0 


30.8 


30 . 8 


12.0 


12.0 


riPYrt/i c n aaa nt< i 

jjciAU4oo o^p.nt.l 


32972 . 0 


0 . 0 


0 . 0 


30 . 0 


30.0 


15 .4 


15 .4 


4.0 


4.0 


UJfcjA04bO o^o.nt.i 


32990 . 0 


0 . 0 


0 . 0 


20 . 0 


20 . 0 


7 . 7 


7.7 


4.0 


4.0 


JJbA04bO 020.nt.l 


32991 . 0 


0 . 0 


0 . 0 


20. 0 


20 . 0 


7.7 


7.7 


4.0 


4.0 


UbA0450 020.nt.l 


32992 . 0 


3 . 6 


3 .6 


20.0 


20 . 0 


7.7 


7.7 


8.0 


8.0 


DbA04bO O20.nt.l 


32994 . 0 


0 . 0 


0 . 0 


20 . 0 


20 . 0 


7 . 7 


7.7 


4.0 


4.0 


DhA0450 020.nt.l 


32995 . 0 


0 . 0 


0 . 0 


20.0 


20.0 


7.7 


7.7 


4.0 


4.0 


DhAU450 OzO.nt.l 


35546 . 0 


3 . 6 


3 . 6 


20.0 


20 . 0 


7.7 


7.7 


8.0 


8.0 


ni?VA/lCrt AAA »-> i- *"i 

U.b,AO4b0 02 0.nt.2 


32972 . 0 


0 . 0 


0 . 0 


30 . 0 


30 . 0 


15 .4 


15 .4 


4.0 


4.0 


nPYA/l Cf> AAA 4- o 

DE,A04bO 02 0.nt.2 


32990 . 0 


0 . 0 


0 . 0 


20 . 0 


20 . 0 


7 . 7 


7.7 


4.0 


4.0 


npVA/ C A AAA A 

UHiAO4b0 UzU.Ht.2 


A A A A n 

32991 . 0 


0 - 0 


0 . 0 


20.0 


20 . 0 


7 . 7 


7 . 7 


4 . 0 


4.0 


nPYA/l CA AAA i- A 

L)l!iA04oO 02 0.nt.2 


32992 . 0 


3 . 6 


3 . 6 


20 . 0 


20 . 0 


7 . 7 


7.7 


8.0 


8.0 


nuYf\/cr\ AAA « *- A 
UejA04dO 02 0.nt.2 


32995 . 0 


0 . 0 


0 . 0 


20 . 0 


20 . 0 


7 . 7 


7.7 


4.0 


4.0 


nCYA^ Crt AAA — i_ o 

UfciA04bO 02 0.nt.2 


35546 . 0 


3 . 6 


3 .6 


20 . 0 


20 . 0 


7.7 


7.7 


8.0 


8.0 ! 


ADVA / CA A A r» _ i_ «■» 

Dii.A0450 020. nt. 3 


T A A A A 

32972 . 0 


0 . 0 


0.0 


30.0 


30.0 


15.4 


15.4 


4.0 


4.0 


DEX045 0 02 0 . nt . 3 


32990 . 0 


0 . 0 


0.0 


20 . 0 


20.0 


7.7 


7.7 


4.0 


4.0 


HPVn/l C A AAA l . — » 

UhA0450 020.nt-3 


32991 . 0 


0 . 0 


0 . 0 


20.0 


20. 0 


7.7 


7.7 


4.0 


4.0 


ncvA/i IT A AAA .-> 4- A 

IJi}A04bO 020. nt. 3 


32992 . 0 


3 . 6 


3 . 6 


20 . 0 


20 . 0 


7 . 7 


7.7 


8.0 


8.0 


nCYA/l CA AAA -r-i +- A 

Uii.A04bO 020. nt. 3 


32995 . 0 


0 . 0 


0 . 0 


20 . 0 


20 . 0 


7.7 


7.7 


4.0 


4.0 


ni?YA>ICA AAA -5 

UliA04bO 020. nt. 3 


35546 . 0 


3 . 6 


3 . 6 


20 . 0 


20 . 0 


7.7 


7.7 


8.0 


8.0 


ni?YA/icA mi i 
UJiA04bO 021.nt.l 


10700 . 0 


3 . 6 


5 . 9 


0 . 0 


0 . 0 


7 . 7 


10. 0 


0.0 


0.0 


UCiAU4bU 02i.nt-l 


1 A A A 1 A 

10 701 . 0 


0 . 0 


0 . 0 


10 . 0 


12 . 5 


7 . 7 


9 . 1 


0.0 


0.0 


mrYn^cn aot nt- i 
uciAUftbu uzi .nc . x 


"1 A A A A A 

10 /44 . 0 


10.7 


11.5 


30.0 


33 . 3 


30.8 


30 . 8 


8 . 0 


9.1 


Hl?YAy1 CT A All _ i_ -i 

JJiiAU4bO 02i.nt.l 


10745 . 0 


3 . 6 


4 . 3 


10 . 0 


12 . 5 


15 .4 


16. 7 


0.0 


0.0 


ni?Yn/icn aaa »■«*- i 
iJlliAU4bO 022.nt«i 


12057.0 


3 . 6 


3 . 8 


30.0 


30.0 


30 . 8 


30.8 


0 . 0 


0.0 


T~M?YA>ICrA AAA v* t- *l 

U.c*A04bO 022.nt-l 


12058 . 0 


0 . 0 


0 . 0 


10 . 0 


10 . 0 


7.7 


7.7 


0.0 


0.0 


ni7YA/l C A AAA «*- 1 

UilA04b0 023 . nt . 1 


8910 . 0 


25 . 0 


25 . 0 


30 . 0 


30.0 


53 . 8 


53 .8 


12.0 


12.0 


n'CVA/ICA AA>1 „ j_ -t 

UliA04bO 024.nt.l 


36564 . 0 


28 . 6 


28.6 


50.0 


50 . 0 


61 .5 


61.5 


20.0 


20.0 


L)hAu450 025.nt.l 


37705 . 0 


28 .6 


28.6 


20 . 0 


20 . 0 


15.4 


15.4 


32.0 


32.0 


TM7 1 V A A C r\ A ^ r _ *_ 1 

DiLA04bO 02b.nt.l 


37706 . 0 


25 .0 


25 . 0 


10 . 0 


10 . 0 


7.7 


7.7 


28.0 


28.0 


UiliA04bO 02 7.nt.l 


35470 . 0 


32 . 1 


32 . 1 


40 . 0 


40 . 0 


30.8 


30 . 8 


36. 0 


36.0 


r\PVAi1 C A AAA .-.*- T 

Uh.A04bO 027.nt.l 


35471 . 0 


42 . 9 


42 . 9 


40.0 


40 . 0 


38.5 


38.5 


44.0 


44.0 


APVA/CA AAA *_ »~\ 

DliA04bO 027.nt«2 


35470 . 0 


32 . 1 


32 . 1 


40 . 0 


40 . 0 


30.8 


30.8 


36. 0 


36.0 


ADVAACA AAA __ i_ o 

UHiA04bO 027. nt. 2 


35471 . 0 


42 . 9 


42 . 9 


40 . 0 


40 . 0 


38 . 5 


38.5 


44.0 


44 . 0 


nDYA/ C A A AO _ x_ -i 

UfciA04bO 02o.nt.l 


31539 . 0 


0 . 0 


0 . 0 


0.0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


UhiA04bO 028.nt.l 


31545 . 0 


7 . 1 


7 . 1 


30. 0 


30 . 0 


30 . 8 


30.8 


4.0 


4.0 


nPYA/1 C A A*3A *- i 

JJhjA04bO 03 0.nt-l 


8410 . 0 


14 . 3 


14 . 3 


10 . 0 


10.0 


30.8 


30.8 


4.0 


4.0 


TMPYA/ CA ATA 1 

UbA04bO 030.nt.l 


8411 . 0 


14 . 3 


14 . 3 


10 . 0 


10 . 0 


30.8 


30.8 


4.0 


4.0 


Jjh.A04bO 031.nt.l 


21356 . 0 


17 . 9 


17. 9 


20 . 0 


22.2 


15.4 


16.7 


20.0 


20.0 


DEX0450 031 nt 1 




i n 7 


ii i 


*± u . u 


40.0 


23.1 


AT 1 

23 . 1 


16 . 0 


16.7 


DEX0450 031. nt. 2 


21356.0 


17.9 


17.9 


20.0 


22.2 


15.4 


16.7 


20.0 


20.0 


DEX0450 032.nt.l 


34888.0 


14.3 


14.3 


30.0 


30.0 


38.5 


38.5 


8.0 


8.0 


DEX0450 033.nt.l 


30532.0 


35.7 


35.7 


70.0 


70.0 


61.5 


61.5 


36.0 


36.0 


DEX0450 033. nt. 2 


34642.0 


21.4 


25.0 


30.0 


50.0 


23.1 


33.3 


24.0 


28.6 


DEX0450 033. nt. 2 


34643 .0 


10.7 


12.0 


50.0 


62.5 


7.7 


10.0 


28.0 


30.4 


DEX0450 034.nt.l 


37498.0 


0.0 


0.0 


30.0 


30.0 |23.1 


23.1 


0.0 


0.0 



! 

! 
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DEX0450 035.nt.i: 


>0153 .0 3 


.4.3 3 


.4.8 4 


tO.O 4 


tO.O 3 


8.5 2 


\Q . 5 3 


.2.0 j 


.A . D 


DEX0450 035.nt.i; 


>0154 .0 3 


.0.7 3 


.0.7 C 


).0 c 


).0 


' . 7 


f .7 6 


! . 0 £ 


1 . 0 


DEX0450 036.nt.i: 


17615.0 4 


L6.4 4 


16.4 5 


>0 . 0 E 


50.0 3 


10.8 : 


$0.8 5 


>6 . 0 E 


>6 - 0 


DEX0450 036.nt.i: 


57616 .0 4 


12.9 < 


12.9 E 


50.0 I 


50.0 : 


io.8 : 


50 . 8 E 


52 . 0 E 


52 . 0 


DEX0450 036.nt.i: 


57635.0 : 


$9.3 : 


59.3 5 


50.0 E 


io.o ; 


>3.i : 


23.1 E 


52.0 E 


>2 . 0 


DEX0450 037.nt.i: 


53741.0 ' 


16 .4 * 


18.1 < 


10. 0 ' 


i4 .4 ; 


>3 .1 : 


27 .3 E 


56.0 E 


>6 . 0 


DEX0450 038.nt.i; 


58975 .0 : 


L4.3 : 


L4.3 : 


20.0 : 


20.0 ] 


L5 . 4 : 


L5 . 4 : 


.6.0 : 


.6 . U 


DEX0450 038.nt.l] 


58976.0 : 


L4.3 : 


L4.3 : 


30.0 : 


50.0 ] 


L5.4 : 


L5 .4 * 


20.0 \ 


20.0 


DEX0450 039.nt.i: 


38501.0 ' 


7.1 


7.1 ( 


3.0 ( 


3.0 : 


L5 .4 : 


L5 . 4 ( 


3.0 ( 


J . u \ 


DEX0450 039.nt.i: 


38502.0 ' 


7.1 


7.1 ( 


3.0 ( 


3.0 


L5 .4 


L5 . 4 ( 


J . 0 ( 


J . U 


DEX0450 039. nt. 2. 


36599.0 : 


3.6 


3.6 


3.0 ( 


3.0 


7.7 


7.7 i 


J - 0 


J . u 


DEX0450 039. nt. 2 


38428 .0 : 


L0.7 


10.7 ( 


3.0 < 


3.0 


L5 .4 


15 . 4 ' 


1.0 4 


± . U 


DEX0450 039. nt. 2 


38499.0 ' 


L0.7 


10.7 


O.O 


O.O 


15 .4 


15 . 4 ' 


4.0 * 


\ . U 


DEX0450 039. nt. 2 


38501.0 * 


7.1 


7.1 


0.0 


0.0 


15 .4 


15 . 4 


0 - 0 


I) . U 


DEX0450 039. nt. 2 


38502.0 


7.1 


7.1 


0.0 


0.0 


15 . 4 


15 . 4 


3 . 0 


n A 
Q . U 


DEX0450 039. nt. 2 


38509.0 


7.1 


7.1 


0.0 


0.0 


7.7 


7 . 7 


4 . 0 


4 . 0 


DEX0450 039. nt. 2 


38510.0 


3.6 


3.6 


0.0 


0.0 


7 . 7 


7 . 7 


0 . 0 


0 . 0 


DEX0450 039. nt. 3 


36599.0 


3.6 


3.6 


0.0 


0.0 


7.7 


7 . 7 


0 . 0 


0 . 0 


DEX0450 039. nt. 3 


38428.0 


10.7 


10.7 


0.0 


0.0 


15.4 


15 . 4 


4 . 0 


4 . 0 


DEX0450 039. nt. 3 


38499.0 


10.7 


10.7 


0.0 


0.0 


15 . 4 


15 . 4 


4.0 


A A 

4 . 0 


DEX0450 039. nt. 3 


38501. 0 


7.1 


7.1 


0.0 


0.0 


15 . 4 


15 . 4 


0 . 0 


A A 


DEX0450 039. nt. 3 


38509.0 


7.1 


7.1 


0.0 


0.0 


7 . 7 


7 . 7 


4 . 0 


A A 


DEX0450 039. nt. 3 


38510.0 


3.6 


3.6 


0.0 


0.0 


7 . 7 


7 . 7 


0 . 0 


0 . 0 


DEX0450 039. nt. 4 


36599. 0 


3.6 


3.6 


0.0 


0.0 


7 . 7 


7 . 7 


0 . 0 


0 . 0 


DEX0450 039. nt. 4 


38428.0 


10 .7 


10 .7 


0.0 


0.0 


15.4 


15 .4 


4 . 0 


4.0 


DEX0450 039. nt. 4 


38499.0 


10.7 


10.7 


0.0 


0.0 


15.4 


15 .4 


4 . 0 


4 . 0 


DEX0450 039. nt. 4 


38501.0 


7.1 


7.1 


0.0 


0.0 


15 .4 


15 .4 


0 . 0 


0 . 0 


DEX0450 039. nt. 4 


38502.0 


7.1 


7.1 


0.0 


0.0 


15.4 


15 .4 


0 . 0 


0 . 0 


DEX0450 039. nt. 4 


38509.0 


7.1 


7.1 


0.0 


0.0 


7 .7 


7 . 7 


4 . 0 


4 . 0 


DEX0450 039. nt. 4 


38510.0 


3.6 


3.6 


0.0 


0.0 


7 . 7 


7 . 7 


0 . 0 


0 . 0 


DEX0450 039. nt. 5 


38501.0 


7.1 


7.1 


0.0 


0.0 


15 . 4 


15 . 4 


0 . 0 


0 . 0 


DEX0450 039. nt. 5 


38502.0 


7.1 


7.1 


0.0 


0 . 0 


15 .4 


15 . 4 


0 . 0 


A A 

0 . u 


DEX0450 039. nt. 6 


38501.0 


7.1 


7 . 1 


0.0 


0 . 0 


15 . 4 


15 . 4 


a a 


a n 


DEX0450 039. nt. 6 


38502.0 


7.1 


7.1 


0.0 


0 . 0 


15 . 4 


15 . 4 


O . U 


u . u 


DEX0450 O40.nt.l 


35520.0 


46.4 


46.4 


20.0 


20.0 


46 . 2 


46 - 2 


36.0 


JO .u 


DEX0450 O40.nt.l 


35521.0 


46.4 


46 .4 


30.0 


30.0 


53 . 8 


53 . 8 


36.0 


1 C A 

Jo .u 


DEX0450 040.nt.l 


39581.0 


35.7 


38.5 


30.0 


30.0 


46 .2 


50 . 0 


28.0 


29.2 


DEX0450 040.nt.l 


39582.0 


46.4 


46.4 


30.0 


30.0 


53 . 8 


53 . 8 


36.0 


1 C A 


DEX0450 041.nt.l 


33072.0 


0.0 


0.0 


10.0 


10.0 


0.0 


0 . 0 


4 . 0 


4 . 0 


DEX0450 041.nt.l 


33074.0 


7.1 


7.4 


50.0 


50.0 


38.5 


38 .5 


8 . 0 


8 . 3 


DEX0450 041.nt.l 


33075.0 


3.6 


3.6 


40.0 


40.0 


30 . 8 


30 . 8 


4 . 0 


4 . 0 


DEX0450 041.nt.l 


36755.0 


0.0 


0.0 


20.0 


20.0 


15 .4 


15 .4 


0 . 0 


0 . 0 


DEX0450 041.nt.l 


36756.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


DEX0450 041.nt.l 


37290.0 


3.6 


3.6 


10.0 


10.0 


7.7 


7 . 7 


4 . 0 


4 . 0 


DEX0450 042.nt.l 


33118.0 


3.6 


3.6 


20.0 


20.0 


23 . 1 


23 . 1 


0 . 0 


A A 
0 . 0 


DEX0450 042.nt.l 


33119.0 


3.6 


3.6 


10. 0 


10.0 


15 . 4 


15 - 4 


0 . 0 


A A 


DEX0450 042. nt. 3 


.34009.0 


21.4 


21.4 


40.0 


44 .4 


53 . 8 


53 . 8 


12 . 0 


12 . b 


DEX0450 042. nt. 3 


.40737.0 


10.7 


10.7 


20.0 


20.0 


30.8 


30 . 8 


4 . 0 


A A 

4 . 0 


DEX0450 042. nt. 2 


133118.0 


3.6 


3.6 


20.0 


20.0 


23 . 1 


23 . 1 


0 . 0 


r\ a 

0 . 0 


DEX0450 042. nt . 1 


>33119.C 


3.6 


3.6 


10.0 


10.0 


15 .4 


15 .4 


0 - 0 


0 . 0 


DEX0450 042. nt . 1 


I 34009 . C 


> 21 . 4 


11 A 




44 4 


53 - 8 


53 . 8 


12 . 0 


12 .5 


DEX0450 042. nt.: 


>40737.C 


1 10.7 


10.7 


20.0 


20.0 


30.8 


30.8 


4.0 


4.0 


DEX0450 043. nt.] 


L35881.C 


) 0.0 


0.0 


10.0 


12.5 


7.7 


9.1 


0.0 


0.0 


DEX0450 043. nt.] 


L35882.C 


) 3.6 


25.0 


0.0 


0.0 


0.0 


0.0 


4.0 


25.0 


DEX0450 043. nt.] 


L36669.C 


) 25.0 


25.0 


40.0 


40.0 


38.5 


38.5 


24.0 


24 .0 


DEX0450 043. nt.: 


L36670.C 


) 25.0 


25.0 


40.0 


40.0 


38.5 


38.5 


24.0 


24.0 


DEX0450 044. nt.: 


L9110.0 


28.6 


28.6 


30.0 


30.0 


23 .1 


23.1 


32.0 


32.0 
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DEX0450 045.nt.l 


13783.0 


14.3 


14.3 


20.0 


20.0 


30.8 


30.8 


8.0 


8.0 


DEX0450 045.nt.l 


13784.0 


14.3 


14 .3 


10.0 


10.0 


30.8 


30.8 


4.0 


4.0 


DEX0450 046.nt.l 


10297.0 


53.6 


53.6 


90.0 


90.0 


84.6 


84.6 


52.0 


52.0 


DEX0450 047.nt.l 


30511.0 


14.3 


14.8 


50.0 


50.0 


53.8 


53 .8 


8.0 


8.3 


DEX0450 047.nt.l 


38789.0 


7.1 


7.1 


10.0 


10.0 


15.4 


15.4 


4.0 


4.0 


DEX0450 047.nt.l 


39412.0 


7.1 


7.1 


10.0 


10.0 


15.4 


15.4 


4.0 


4.0 


DEX0450 048.nt.l 


41177.0 


3.6 


3.6 


20.0 


20.0 


15.4 


15.4 


4.0 


4.0 


DEX0450 048.nt.l 


41178.0 


3.6 


3.7 


30.0 


30.0 


23.1 


23 .1 


4.0 


4.2 


DEX0450 049.nt.l 


31424.0 


7.1 


8.0 


20.0 


22.2 


23.1 


23.1 


4.0 


4.8 


DEX0450 049.nt.l 


31425.0 


3.6 


4.0 


30.0 


37.5 


30.8 


30. 8 


0.0 


0.0 


DEX0450 05O.nt.l 


9398.0 


28.6 


28.6 


20.0 


20.0 


30.8 


30.8 


24 . 0 


24.0 


DEX0450 050.nt.l 


9399.0 


28.6 


28.6 


20.0 


20.0 


30.8 


30. 8 


24 .0 


24.0 


DEX0450 052.nt.l 


37943.0 


46.4 


46.4 


70.0 


70.0 


76.9 


76. 9 


40.0 


40.0 


DEX0450 052.nt.l 


37944.0 


42.9 


44.4 


70.0 


70.0 


76.9 


76.9 


36.0 


37.5 


DEX0450 053.nt.l 


9398.0 


28.6 


28.6 


20.0 


20.0 


30.8 


30.8 


24.0 


24 .0 


DEX0450 053;nt.l 


9399.0 


28.6 


28.6 


20.0 


20.0 


30.8 


30.8 


24.0 


24.0 



Table 3. 



DEX ID 


Oligo 
Name 


Cln 
Multi- 
Cancer 
ALL %up 
n=27 


Cln 

Multi- 
Cancer 
ALL 

%valid 
up n=27 


Cln 

Multi- 
Cancer 
ASC %up 
n=14 


Cln 
Multi- 
Cancer 
ASC 

%valid 
up n=14 


Cln 
Multi- 
Cancer 
RS %up 
n=13 


Cln 
Multi- 
Cancer 
RS 

% valid 
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BREAST CANCER CHIPS 
5 For breast cancer two different chip designs were evaluated with overlapping sets 

of a total of 36 samples, comparing the expression patterns of breast cancer derived 
polyA+ RNA to polyA+ RNA isolated from a pool of 10 normal breast tissues. For the 
Breast Array Chip, all 36 samples (9 stage I cancers, 23 stage II cancers, 4 stage m 
cancers) were analyzed. These samples also represented 10 Grade 1/2 and 26 Grade 3 

10 cancers. The histopathologic grades for cancer are classified as follows: GX, cannot be 
assessed; Gl, well differentiated; G2, moderately differentiated; G3, poorly differentiated; 
and G4, undifferentiated. AJCC Cancer Staging Handbook, pp. 9, (5th Ed, 1998). 
Samples were further grouped based on the expression patterns of the known breast cancer 
associated genes Her2 and ERa (10 HER2 up, 26 HER2 not up, 20 ER up and 16 ER not 

15 up) and for the Multi-Cancer Array Chip, a subset of 20 of these samples (9 stage I 
cancers, 8 stage II cancers, 3 stage EH cancers) were assessed. 

The results for the statistically significant up-regulated genes on the Breast Array 
Chip are shown in Tables 4 and 5. The results for the statistically significant up- 
regulated genes on the Multi-Cancer Array Chip are shown in Table 6. The first two 
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columns of each table contain information about the sequence itself (Seq ID, Oligo 
Name), the next columns show the results obtained for all ("ALL") breast cancer 
samples, cancers corresponding to stagel ("ST1"), stages II and III ("ST2,3"), grades 1 
and 2 ("GR1,2"), grade 3 ("GIG"), cancers exhibiting up-regulation of Her2 ("HER2up") 
5 or ERa ("ERup") or those not exhibiting up-regulation of Her2 ("NOT HER2up") or ERa 
("NOT ERup"). '%up' indicates the percentage of all experiments in which up- 
regulation of at least 2-fold was observed (n=36 for Colon Array Chip, n=20 for the 
Multi-Cancer Array Chip), '%valid up' indicates the percentage of experiments with 
valid expression values in which up-regulation of at least 2-fold was observed. 
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0 . 0 


0 . 0 


0 . 0 


A A 

0 . 0 


DEX0450_03 
2.nt.6 


19576 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


rt rt 
0-0 


A A 

0 . 0 


DEX0450_03 
2 . nt . 6 


20237 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


rt rt 
0 . 0 


A A 

0 . 0 


DEX0450_03 
2 . nt . 6 


"5 T C Q A A 

3 1590 . 0 


rt rt 
0 . 0 


rt A 

0 . 0 


A A 

0 . 0 


A A 

0 . 0 


rt rt 
0 . 0 


rt rt 

0 . 0 


A rt 

0 . 0 


A A 

0.0 


rt rt 
0 . 0 


rt rt 
0 . 0 


DEX0450_03 
2.nt .6 


i 1591 . 0 


rt rt 

0 . 0 


rt rt 

0 . 0 


A A 

0 . 0 


A A 

0 . 0 


rt rt 

0 . 0 


rt rt 
0 . 0 


rt rt 

0-0 


A A 

0 . 0 


rt rt 
0 - o 


rt rt 


DEX0450_03 
2.nt.7 


15315 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


rt rt 

0 . 0 


rt rt 
0 . 0 


DEX0450_03 
2.nt.7 


153 17 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


rt rt 

0 . 0 


rt rt 
0 . 0 


DEX0450_03 
2.nt.7 


15320 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0450_03 
2 . nt . 7 


15324 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


A A 

0 . 0 


DEX0450J33 
2 . nt . 7 


15848 . 0 


0 . 0 


0 . 0 


0 - 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


0 . 0 


DEX0450_03 
2.nt .7 


IJUV J . \J 


n, n. 






0 . 0 


n fi 

\J . u 


n n 

u . u 


n o. 

\J . \J 


fi fi 
u . u 


0 . 0 


0 . 0 


DEX0450_03 
2.nt.7 


16906.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450JD3 
2.nt .7 


16908.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_03 
2.nt.7 


16912.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0-0 


0.0 
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DEX0450_03 
2.nt.7 


19576.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


A A 


A f\ 
J . U 


n n 
j • \j 


n o 


0 . 0 


0 . 0 


DEX0450_03 
2 . nt . 7 


20237.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


A r\ 


A. A 


n n 


n o 


0 . 0 


0 . 0 


DEX0450_03 
2.nt.7 


31590. 0 


0.0 


0 . 0 


0 . 0 


0 . 0 


A A 


A A 

u . u 


u . u 


n n 

U . U 


0 . 0 


0 . 0 


DEX0450 03 
2 .nt.7 


31591.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


A A 
U - 0 


A A 


a n 


n n 


o n 


DEX0450 04 
4 . nt . 1 


24210.0 


2.8 


2.8 


11 . 1 


11 . 1 


0 . 0 


A A 
0 . U 


u . u 


A A 


^ R 
J • o 


3 . 8 


DEX0450 04 
4. nt.l 


24211.0 


5.6 


5.6 


11 . 1 


11 . 1 


3 . 7 


3 . / 


u . u 


u . u 


7 . 7 


7 . 7 


DEX0450_04 
4 . nt . 2 


24210.0 


2.8 


2 .8 


11 . 1 


11 . 1 


A A 
0.0 


A A 

u . u 




n n 
u . u 


1 8 
o • o 


3 . 8 


DEX0450 04 
4 . nt . 2 


24211.0 


5.6 


5 .6 


11 . 1 


11 . 1 


3 . 7 


J . / 


a a 


u . u 


7 . 7 


7 . 7 


DEX0450 04 
4.nt.3 


24210.0 


2.8 


2.8 


11 . 1 


11 . 1 


0 . 0 


A A 


a n 


n n 
u . u 


ft 


3 . 8 


DEX0450 04 
4.nt.3 


24211.0 


5.6 


5.6 


11.1 


11 . 1 


3 . 7 


3 . 7 


A A 


A A 




7 7 


DEX0450_04 
5 . nt . 1 




3 . Or 


c c. 

D . O 


0 . 0 


0 . 0 


7 . 4 


7.4 


0.0 


0.0 


1.1 


7.7 


DEX0450_04 
5. nt.l 


19854 .0 


2.8 


2.8 


0.0 


0.0 


3.7 


3 .7 


0.0 


0.0 


3.8 


3.8 


DEX0450_05 
3. nt.l 


31644.0 


19.4 


19.4 


22.2 


22.2 


18.5 


18.5 


20.0 


20.0 


19.2 


19.2 


DEX0450_05 
3 .nt.l 


31645.0 


19.4 


19.4 


22.2 


22.2 


18.5 


18.5 


20. 0 


20.0 


19.2 


19.2 



Table 5. 



DEX ID 


Dligo 
Name 


Mam 

HER2up 

%up 

n=10 


Mam 

HER2up 
%valid 
up 

n=10 


Mam 
NOT 

HER2up 

%up 

n=26 


*lam 
NOT 

HER2up 
%valid 
up 

n=26 


Mam 
ERup 
%up 
n=20 


Mam 
ERup 
%valid 
up 

n=20 


Mam 

NOT 

ERup 

%up 

n=16 


Mam 

NOT 

ERup 

% valid 

up 

n=16 


DEX0450 010. nt.l 


17829.0 


0.0 


0.0 


7.7 


8.0 


0.0 


0.0 


12.5 


13.3 


DEX0450 010. nt.l 


22376.0 


0.0 


0.0 


11.5 


12.0 


0.0 


0.0 


18.8 


20.0 


DEX0450 010. nt. 3 


17829.0 


0.0 


0.0 


7.7 


8.0 


0.0 


0.0 


12.5 


13 .3 


DEX0450 010. nt. 3 


22376.0 


0.0 


0.0 


11.5 


12.0 


0.0 


0.0 


18.8 


20.0 


DEX0450 023. nt.l 


22060.0 


20.0 


20.0 


3.8 


3.8 


5.0 


5.0 


12.5 


12.5 


DEX0450 031. nt.l 


17869.0 


0.0 


0.0 


11.5 


11.5 


0.0 


0.0 


18.8 


18.8 


DEX0450 031. nt.l 


31466.0 


0.0 


0.0 


11.5 


11.5 


0.0 


0.0 


18.8 


18.8 


DEX0450 032. nt.l 


15848.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt.l 


16908.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt.l 


16912.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt.l 


19576.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


15315.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


15317.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


15320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


15324.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


15829.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


15830.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


15848.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


15865.0 


0.0 


0.0 


0.0 


o.o. 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


16111.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


16906.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


16908.0 


0.0 


0.0 


0.0 


0.0 


0.0 |o.o 


0.0 


0.0 
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DEX0450 032. nt. 2 


16910.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


16912.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


19576.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


b.o 


0.0 


DEX0450 032. nt. 2 


20237.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


31590.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


31591.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


15315.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


15317.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


15320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


15324.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


15829.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


15830.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


15848.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


15865.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


16111.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


16906.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


16908.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


16910.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


16912.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032.nt.3 


19576.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


20237.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


31590.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 3 


31591.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


15315.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


15317.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


15320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


15324.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


15829.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


15830.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


15848.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


15865.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


16111.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


16906.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


16908.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


16910.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


16912.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


19576.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


20237.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


31590.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 4 


31591.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


15315.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


15317.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


15320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


15324.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


15829.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX045 0 032. nt. 5 


15830.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


15848.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


15865.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


16111.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


16906.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


16908.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032.nt.5 


16910.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


16912.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


19576.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


20237.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


31590. 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 5 


31591.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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DEX0450 032. nt. 6 1 


L5315.0( 


).0 ( 


).0 ( 


).0 < 


).0 < 


3.0 ( 


) . o t 


) . U \ 


J . u 


DEX0450 032. nt. 6: 


L5320.0 ( 


3.0 < 


).0 ( 


3.0 ( 


).0 ( 


3.0 ( 


) . 0 ( 


\ a i 

) . U I 


J . u 


DEX0450 032. nt. 6! 


L5324.0 < 


3.0 ( 


D.O ( 


D.O ( 


3.0 ( 


3.0 ( 


3.0 ( 


3.0 ( 


^ A 


DEX0450 032. nt. 6* 


L5829.0 < 


3.0 < 


3.0 < 


D.O ( 


D.O 


D.O ( 


3.0 ( 




"» A 


DEX0450 032. nt. 6 


15830. 0< 


D.O < 


D.O < 


D.O 


D.O 


D.O ( 


3.0 ( 


D.O i 


*\ A 

J . u 


DEX0450 032. nt. 6 


15848.0< 


D.O 


D.O 


D.O 


D.O 


0.0 < 


D . 0 


D . 0 


'I A 


DEX0450 032. nt. 6 


15865.0 


0.0 


D.O 


0.0 


0.0 


0 . 0 


D . 0 


D . 0 


r\ a 


DEX0450 032. nt. 6 


16111-0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


3 . 0 


A A 


DEX0450 032. nt. 6 


16906.0 


0.0 


0.0 


o.o 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


DEX0450 032. nt. 6 


16908.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


DEX0450 032. nt. 6 


16910.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


A A 


DEX0450 032. nt. 6 


16912.0 


0.0 


o.o 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


A A 


DEX0450 032. nt. 6 


19576.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


A A 


a n 


DEX0450 032. nt. 6 


20237.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


A A 


ft ft 


DEX0450 032. nt. 6 


31590.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


A A 

U . (J 


ft ft 


DEX0450 032. nt. 6 


31591.0 


0.0 


0.0 


0.0 


o.o 


0 . 0 


0 . 0 


A A 


ft ft 


DEX0450 032.nt.7 


15315.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


A A 


A ft 


DEX0450 032. nt. 7 


15317.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


A ft 


DEX0450 032. nt. 7 


15320.0 


0.0 


o.o 


0.0 


0.0 


0. 0 


0 . 0 


0 . 0 


A A 


DEX0450 032. nt. 7 


15324.0 


0.0 


0.0 


0.0 


o.o 


0 . 0 


0 . 0 


0 . 0 


A A 

0 . U ; 


DEX0450 032. nt. 7 


15848.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


A A ■ 


DEX0450 032. nt. 7 


15865.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


DEX0450 032. nt. 7 


16906.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


DEX0450 032. nt. 7 


16908 .0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


A A 


DEX0450 032. nt. 7 


16912.0 


0.0 


0.0 


0.0 


0.0 


o.o 


0 . 0 


0 . 0 


A A 


DEX0450 032. nt. 7 


19576.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


A A 


DEX0450 032. nt. 7 


20237.0 


0.0 


0.0 


0.0 


o.o 


0 . 0 


0 . 0 


0 . 0 


A A 

u . u 


DEX0450 032. nt. 7 


31590.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


A A 


DEX0450 032. nt. 7 


31591.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


A A 


DEX0450 044.nt.l 


24210.0 


0.0 


0.0 


3.8 


3.8 


0 . 0 


0 . 0 


6 . 2 


c *■> 


DEX0450 044.nt.l 


24211.0 


10.0 


10.0 


3.8 


3.8 


5 . 0 


5 . 0 


6 . 2 


b - Z 


DEX0450 044. nt. 2 


24210.0 


0.0 


0.0 


3.8 


3.8 


0.0 


0 . 0 


6 . 2 


6 . Z 


DEX0450 044. nt. 2 


24211 . 0 


10 . 0 


± u . u 


o.o 


3 . 8 


5 . 0 


5.0 


6.2 


6.2 


DEX0450 044. nt. 3 


24210. C 


0.0 


0.0 


3.8 


3.8 


0.0 


0.0 


6.2 


6.2 


DEX0450 044. nt. 3 


24211. C 


10.0 


10.0 


3.8 


3.8 


5.0 


5.0 


6.2 


6.2 


DEX0450 045. nt . 3 


. 19853 -C 


0.0 


0.0 


7.7 


7.7 


0.0 


0.0 


12.5 


12.5 


DEX0450 045. nt . 1 


. 19854. C 


) 0.0 


0.0 


3.8 


3.8 


0.0 


0.0 


6.2 


6.2 


DEX0450 053. nt. 3 


.31644. C 


) 0.0 


0.0 


26.9 


26.9 


15.0 


15.0 


25.0 


25.0 


DEX0450 053. nt.] 


.31645. C 


) 0.0 


0.0 


26.9 


26.9 


15.0 


15.0 


25.0|25.0 



Table 6. 



DEX ID 


Oligo 
Name 


Mam 
Multi- 
Cancer 
ALL %up 
n=20 


Mam 
Multi- 
Cancer 
ALL 

% valid 
up n=20 


Mam 
Multi- 
Cancer 
STl %up 
n=9 


Mam 
Multi- 
Cancer 
STl 

%valid 
up n=9 


Mam 
Multi- 
Cancer 
ST2,3 
%up 
n=ll 


Mam 
Multi- 
Cancer 
ST2,3 
% valid 
up n=ll 


DEX0450 025.nt.l 


77759.0 


15.0 


15.0 


11.1 


11.1 


18.2 


18.2 


DEX0450 025.nt.l 


77759. 1 


15.0 


15.0 


11.1 


11.1 


18.2 


18.2 


DEX0450 031.nt.l 


956.0 


5.0 


5.0 


11.1 


11.1 


0.0 


0.0 



LUNG CANCER CHIPS 
5 For lung cancer two different chip designs were evaluated with overlapping sets 

a total of 29 samples, comparing the expression patterns of lung cancer derived polyA+ 
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RNA to polyA+ RNA isolated from a pool of 12 normal lung tissues. For the Lung Array 
Chip all 29 samples (15 squamous cell carcinomas and 14 adenocarcinomas including 14 
stage I and 15 stage U/Hl cancers) were analyzed and for the Multi-Cancer Array Chip a 
subset of 22 of these samples (10 squamous cell carcinomas, 12 adenocarcinomas) were 
5 assessed. 

The results for the statistically significant up-regulated genes on the Lung Array 
Chip are shown in Table 7. The results for the statistically significant up-regulated genes 
on the Multi-Cancer Array Chip are shown in Table 8. The first two columns of each 
table contain information about the sequence itself (DEX ID, Oligo Name), the next 

10 columns show the results obtained for all ("ALL") lung cancer samples, squamous cell 
carcinomas ("SQ"), adenocarcinomas ("AD"), or cancers corresponding to stage I 
("STl"), or stages II and III ("ST2,3"). '%up' indicates the percentage of all experiments 
in which up-regulation of at least 2-fold was observed (n=29 for Lung Array Chip, n=22 
for Multi-Cancer Array Chip), <0 /ovalid up' indicates the percentage of experiments with 

15 valid expression values in which up-regulation of at least 2-fold was observed. 



Table 7. 



DEX ID 


Oligo 
Name 


Lng 
ALL 
%up 
n=29 


Lng 
ALL % 
valid 
up 
n=29 


Lng 
SQ 
%up 
n=15 


Lng 
SQ % 
valid 
up 

n=15 


Lng 
AD 
%up 
n=14 


Lng 
AD % 
valid 
up 
n=14 


Lng 
STl 
%up 
n=14 


Lng 
STl % 
valid 
up 
n=14 


Lng 
ST2,3 
%up 
n=15 


Lng 

ST2,3 

% 

valid 
up 

n=15 


DEX0450_001 
.nt.l 


2383.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_001 
.nt . 1 


2400.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_001 
.nt. 1 


2404.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_001 
. nt . 1 


4831.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_001 
.nt.l 


4832.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_031 
.nt.l 


835.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_031 
. nt . 1 


836.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_031 
.nt.l 


953.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 .0 


0.0 


0.0 


DEX0450_031 
. nt . 1 


955.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_031 
. nt . 1 * | 


957.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_031 
. nt . 1 


5254.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_031 
. nt . 1 


5749.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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DEX0450_031 
. nc . l 


5750.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0.0 


0 . 0 


0 . 0 


0 0 


u . u 


DEX0450_031 
- nc . z 


957.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 0 


u . u 


DEX0450_031 
- nc . z 


5254.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


u . u 


DEX0450_031 
. nc . ^ 


5749.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 




u . u 


DEX0450_031 
. nt . 2 


5750.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


n n 


u . u 


DEX0450_033 
. nt . 1 


1234.0 


51.7 


51.7 


80.0 


80. 0 


21.4 


21.4 


57 . 1 


57 . 1 


to. / 


ft b . / 


DEX0450_033 
. nc . z 


5421.0 


44.8 


44.8 


33.3 


33 .3 


57 . 1 


57 . 1 


35 . 7 


35.7 


JJ . -5 


jj . J 


DEX0450_036 
.nt . 1 


5961.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_037 
.nt.l 


1022.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_037 
.nt.l 


1023.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0450_045 
.nt.l 


5599.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 



Table 8. 



DEX ID 


Oligo 
Name 


Lng 

Multi- 
Cancer 
ALL %up 
n=22 


Lng 
Multi- 
Cancer 
ALL 

%valid 
up n=22 


Lng 
Multi- 
Cancer 
SQ %up 
n=10 


Lng 
Multi- 
Cancer 
SQ 

%valid 
up n=10 


Lng 
Multi- 
Cancer 
AD %up 
n=12 


Lng 
Multi- 
Cancer 
AD 

% valid 
up n=12 


DEX0450 025. nt.l 


77759.0 


31.8 


33.3 


30.0 


30.0 


33.3 


36.4 


DEX0450 025. nt.l 


77759.1 


50.0 


50.0 


50.0 


50.0 


50.0 


50.0 


DEX0450 031. nt.l 


956.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 



OVARIAN CANCER CHIPS 
5 For ovarian cancer two different chip designs were evaluated with overlapping sets 

of a total of 19 samples, comparing the expression patterns of ovarian cancer derived total 
RNA to total RNA isolated from a pool of 9 normal ovarian tissues. For the Multi-Cancer 
Array Chip, all 19 samples (14 invasive carcinomas, 5 low malignant potential samples 
were analyzed and for the Ovarian Array Chip, a subset of 17 of these samples (13 
1 0 invasive carcinomas, 4 low malignant potential samples) were assessed. 

The results for the statistically significant up-regulated genes on the Ovarian Array 
Chip are shown in Table 9. The results for the Multi-Cancer Array Chip are shown in 
Table 10. The first two columns of each table contain information about the sequence 
itself (DEX ID, Oligo Name), the next columns show die results obtained for all ("ALL") 
1 5 ovarian cancer samples, invasive carcinomas ("INV") and low malignant potential 
("LMP") samples. '%up' indicates the percentage of all experiments in which up- 
regulation of at least 2-fold was observed (n=19 for the Multi-Cancer Array Chip, n=17 
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10 



15 



for the Ovarian Array Chip), <o/ oVa lid up' indicates the percentage of experiments with 
valid expression values in which up-regulation of at least 2-fold was observed. 
Table 9. 



DEX ID 



DEX0450 029.nfc_1 



DEX0450 029. nt . 1 



DEX0450 031. nt. 



DEX0450 031. nt. 



DEX0450033 . nt . 1 



DEX0450 



DEX0450 



033 .nt. 2 
033. nt. 2 



DEX0450 037.nt.l 



DEX0450 037.nt.l 



Oligo 
Name 



Ovr 
ALL 
%up 
n=17 



37947.01 



37947.02 



9880.01 



9880.02 



18480.01 



0.0 



0.0 



17.6 



17.6 



88.2 



18480 .02 88. 2_ 
2 



18480.01 



18480. 02 



12335.01 



12335.02 



88. 



88.2 



5.9 



5.9 



Ovr ALL 
%valid up 
n=17 



0.0 



0.0 



Ovr 
INV 
%up 
n=13 



Ovr INV 
%valid up 
n=13 



0.0 



21.4 



18.8 



88.2 



88.2 



88.2 



88.2 



5.9 



5.9 



0.0 



15.4 



15.4 



92.3 



0.0 



0.0 



18.2 



16.7 



92.3 



92.3 



92.3 



7.7 



7.7 



92.3 



92.3 



92.3 



Ovr 
LMP 
%up 
n=4 



0.0 



0.0 



25.0 



75.0 
75.0 



75.0 



7.7 



7.7 



75.0 



0.0 



Ovr LMP 
%valid 
up n=4 



0.0 



0.0 



33 .3 
25.0 



75.0 
75.0 



75.0 



0.0 



0.0 



|0.0 



5 Table 10. 



DEX ID 



Oligo 
Name 



DEX0450 025.nt.l 



DEX0450 025.nt.l 



77759.0 



77759.1 



DEX0450 031.nt.l|956.0 



Ovr 
Multi- 
Cancer 
ALL %up 
n=19 



5.3 



5.3 



10.5 



Ovr 
Multi- 
Cancer 
ALL 
% valid 
up n=;19 



5.9 



6.2 



15.4 



Ovr 
Multi- 
Cancer 
INV %up 
n=14 



7.1 



7.1 



7.1 



Ovr 
Multi- 
Cancer 
INV 
% valid 
up n=14 



11.1 



Ovr 
Multi- 
Cancer 
LMP %up 
n=5 



0.0 



20.0 



Ovr 
Multi- 
Cancer 
LMP 

%valid 
up n=5 



0.0 



0.0 



25.0 



PROSTATE CANCER 

For prostate cancer three different chip designs were evaluated with overlapping 
sets of a total of 29 samples, comparing the expression patterns of prostate cancer or 
benign disease derived total RNA to total RNA isolated from a pool of 35 normal prostate 
tissues. For the Prostatel Array and Prostate2 Array Chips all 29 samples (17 prostate 
cancer samples, 12 non-malignant disease samples) were analyzed. For the Multi-Cancer 
Array Chip a subset of 28 of these samples (16 prostate cancer samples, 12 non-malignant 
disease samples) were analyzed. 

The results for the statistically significant up-regulated genes on the Prostatel 
Array Chip and the Prostate2 Array Chip are shown in Table 1 1. The results for the 
statistically significant up-regulated genes on the Multi-Cancer Army Chip are shown in 
Table 12. The first two columns of each table contain information about the sequence 
itself (DEX ID, Oligo Name), the next columns show the results obtained for prostate 
20 cancer samples ("CAN") or non-malignant disease samples («DIS»). <%up> indicates the 
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percentage of all experiments in which up-regulation of at least 2-fold was observed (n=29 
for the Prostate2 Array Chip and the Multi-Cancer Array Chip), '%valid up' indicates the 
percentage of experiments with valid expression values in which up-regulation of at least 
2-fold was observed. 

5 Table 11. 



DEX ID 


Oligo 
Name 


Pro CAN 
%up n=17 


Pro CAN 
%valid up 

_ -I o 

n= l / 


Pro DIS 
%up n=12 


fro uib 
<va iia up 


DEX0450 031.nt.l 


t nin a o 

38723.02 


A A 
0 . 0 


A A 

0 . 0 


A A 
U . U 


a n 
u . u 


DEX0450 031.nt.l 


3 8723 . 03 


0 . 0 


A A 

0 . 0 


A A 


a n 


DEX0450 032.nt.l 


32036 . 01 


0 . 0 


A A 

0 . 0 


A A 

0 . 0 


A A 


DEX0450 032.nt.l 


32036 . 02 


0 . 0 


0 . 0 


A A 


A A 


DEX0450 032.nt.l 


^ rr a a i a i 

35907 . 01 


0.0 


A A 

0 . 0 


A A 


\J • u 


DEX0450 032.nt.l 


O ^ A A O /NO 

35907 .02 


A A 

0 . 0 


A A 

U . 0 


A A 
U . U 


a n 


DEX0450 032.nt.l 


*> it A A a "> 

35907 . 03 


A A 


A A 

0 . 0 


A A 


n o 

U . \J 


DEX0450 032.nt.l 


35945 . 01 


0 . 0 


0 . 0 


A A 
0 . U 


A C\ 


DEX0450 032.nt.l 


35945 . 02 


0 . 0 


A A 

0 . 0 


A A 


A A 


DEX0450 032.nt.l 


35945 . 03 


0 . 0 


0 . 0 


A A 
0 . U 


A A 


0EXO45O 032.nt.l 


35955 . 01 


0 . 0 


0.0 


A A 
0 . U 


A A 1 
U . U 


DEX0450 032.nt.l 


a r Are rt o 

35955 . 02 


0 . 0 


A A 

0 . 0 


A A 


A A 


DEX0450 032. lit. 1 


35955 . 03 


0 . 0 


0 . 0 


A A 

0.0 


A A 


DEX0450 032. nt. 2 


26953 . 01 


0 . 0 


A A 

0 . 0 


A A 
0 . 0 


A A 


DEX0450 032. nt. 2 


26953 . 02 


0 . 0 


0 . 0 


A A 

0 . u 


A A 


DEX0450 032. nt. 2 


31256 . 01 


A A 

0 . 0 


A A 

u . u 


A A 


n A 


nn v a ^ r a a o o H 

DEX0450 032. nt. 2 


31256 . 02 


A A 

0 - 0 


A A 

0 . 0 


A A 


n n 


DEX04S0 032 .nc.2 


O 1 O C C A "3 

J lzbo . 0 J 


A A 

0 . 0 


u . u 






nt 1 v n /i r a A o O »-> 4- o 

DEXQ450 032. nt. 2 


o o A o ri AT 

32020 . 01 


A A 

0 . 0 


A A 


a n 




T"4T? V A >l CT A ATI _ i_ «-> 

DEX045 0 032.nt.2 


"5 O A O A AO 

32020 - 02 


A A 

0 . 0 


A A 

U . 0 


A A 




DEX045G 032. nt. 2 


J203d . 01 


A A 

0 . u 


u . u 


n n 
u . u 


0 0 


nnvA a c n a o o 1_ *•> 

DEX0450 032. nt. 2 


"> O A O ^ AO 

32036 . 02 


A A 

0 . 0 


A A 

U . 0 


A A 


Pi A 


DEX0450 032. nt. 2 


A TA AH 

358 97 . 01 


A A 

0 . 0 


A A 


A A 


fl 0 

u . \J 


DEX0450 032. nt. 2 


35897 . 02 


0 . 0 


A A 
0 . 0 


A A 


n n 


DEX0450 032.nt.2 


a r a An /~\ **> 

35897 . 03 


0 . 0 


A A 
0 . 0 


A A 


A A 


DEX0450 032. nt. 2 


35901 . 01 


0 . 0 


A A 

0 . 0 


A A 


A A 


DEX0450 032. nt. 2 


A P AA1 r\ o 

35901 . 02 


0 . 0 


A A 
0 . 0 


A A 
U . 0 


A A 


DEX0450 032. nt. 2 


*N A A 1 O *» 

35901 . 03 


0 . 0 


A A 
0.0 


A A 


A A 


DEX0450 032. nt. 2 


35905 . 01 


0 . 0 


A A 
0 . 0 


A A 


n a 


DEX0450 032. nt. 2 


^ C tS f\C AO 

3 5905 . 02 


0 . 0 


A A 

0 . u 


A A 


a n 


DEX0450 032 nt 2 


35905 . 03 


0 . 0 


0 . 0 


0 . 0 


0.0 


DEX0450 032. nt. 2 


35907.01 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35907.02 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35907.03 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35919.01 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35919.02 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35919.03 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35945.01 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35945.02 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35945.03 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35951.01 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35951.02 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35951.03 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35955.01 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35955.02 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35955.03 


0.0 


0.0 


0.0 


0.0 
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DEX0450 032. nt . 2 


35957.01 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35957 .02 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35957.03 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35959.01 


0.0 


0.0 


0.0 


0.0 


DEX0450 032. nt. 2 


35959.02 


0.0 


0.0 


0.0 


0.0 


DEX0450 032.nt.2 
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Table 12. 



DEX ID 


Oligo 
Name 


Pro Multi- 
Cancer CAN 
%up n=16 


Pro Multi- 
Cancer CAN 
% valid up 
n=16 


Pro Multi- 
Cancer DIS 
%up n=12 
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0.0 


DEX0450 031.nt.l 


956.0 


0.0 


0.0 


0.0 


0.0 



5 



SEQ ID NO: 1-94 was up-regulated on various tissue microarrays. Accordingly, 
nucleotide SEQ ID NO: 1-94 or the encoded protein SEQ ID NO: 95-248 may be used as 
a cancer therapeutic and/or diagnostic target for the tissues in which expression is shown. 
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The following table lists the location (Oligo Location) where the microarray oligos 
(Oligo ED) map on the transcripts (DEX ID) of the present invention. Each Oligo ID may 
have been printed multiple times on a single chip as replicates. The Oligo Name is an 
exemplary replicate (e.g. 1000.01) for the Oligo ID (e.g. 1000), and data from other 
5 replicates (e.g. 1000.02, 1000.03) may be reported Additionally, the Array (Chip Name) 
that each oligo and oligo replicates were printed on is included. 



UEiA XL) 


Oligo ID 


Oligo 
Name 


Chip Name 


Oligo 
Location 


DEX0450 OOl.nt.l 


2383 


2383.0 


Lung array 


1925-1984 


DEX0450 OOl.nt.l 


4832 


4832.0 


Lung array 


1101-1160 


DEX0450 OOl.nt.l 


4831 


4831.0 


Lung array 


376-435 


DEX0450 OOl.nt.l 


2404 


2404.0 


Lung array 


1925-1984 


DEX0450 OOl.nt.l 


2405 


2405.0 


Multi -Cancer array 


1918-1977 


DEX0450 OOl.nt.l 


2400 


2400.0 


Lung array 


611-670 


DEX0450 002. nt. 1 


31163 


31163.0 


Colon array 


1644-1703 


DEX0450 002.nt.l 


34075 


34075.0 


Colon array 


2521-2580 


DEX0450 002.nt.l 


31158 


31158.0 


Colon array 


2221-2280 


DEX0450 002.nt.l 


31162 


31162.0 


Colon array 


1666-1725 


DEX0450 002.nt.l 


31003 


31003 .0 


Colon array 


1156-1215 


DEX0450 002.nt.l 


34074 


34074.0 


Colon array 


2597-2656 


DEX0450 002.nt.l 


31159 


31159.0 


Colon array 


2168-2227 


DEX0450 003.nt.l 


31446 


31446.0 


Colon array 


37-96 


DEX0450 003. nt. 2 


31447 


31447.0 


Colon array 


3391-3450 


DEX0450 003. nt. 2 


31446 


31446.0 


Colon array 


3635-3694 


DEX0450 003. nt. 3 


31447 


31447.0 


Colon array 


2288-2347 


DEX0450 003. nt. 3 


31443 


31443.0 


Colon array 


1628-1687 


DEX0450 004.nt.l 


10720 


10720.0 


Colon array 


260-319 


DEX0450 004.nt.l 


10721 


10721.0 


Colon array 


220-279 ! 


DEX0450 OOS.nt.l 


38049 


38049.0 


Colon array 


2673-2732 


DEX0450 OOS.nt.l 


16776 


16776.0 


Colon array 


422-481 


DEX0450 OOS.nt.l 


16777 


16777.0 


Colon array 


336-395 


DEX0450 OOS.nt.l 


38050 


38050.0 


Colon array 


2564-2623 1 


DEX0450 006.nt.l 


35170 


35170.0 


Colon array 


511-570 


DEX0450 OOS.nt.l 


35171 


35171.0 


Colon array 


461-520 


DEX0450 O07.nt.l 


35170 


35170.0 


Colon array 


3353-3412 


DEX0450 008.nt.l 


30228 


30228.0 


Colon array 


2517-2576 


DEX04 5 0 008.nt.l 


30227 


30227.0 


Colon array 


2563-2622 


DEX0450 009.nt.l 


39839 


39839.0 


Colon array 


975-1034 


DEX0450 OOS.nt.l 


39840 


39840.0 


Colon array 


822-881 


DEX0450 009. nt. 2 


39839 


39839.0 


Colon array 


1702-1761 


DEX0450 009. nt. 2 


39840 


39840.0 


Colon array 


1549-1608 j 


DEX0450 OlO.nt.l 


29571 


29571.0 


Colon array 


79-138 


DEX0450 OlO.nt.l 


29612 


29612.0 


Colon array 


456-515 


DEX0450 OlO.nt.l 


29595 


29595.0 


Colon array 


493-552 


DEX0450 OlO.nt.l 


29609 


29609.0 


Colon array 


486-545 


DEX0450 OlO.nt.l 


22376 


22376.0 


Breast array 


482-541 


DEX0450 OlO.nt.l 


29582 


29582.0 


Colon array 


482-541 


DEX0450 OlO.nt.l 


29611 


29611.0 


Colon array 


493-552 


DEX0450 OlO.nt.l 


17829 


17829.0 


Breast array 


87-146 


DEX0450 010. nt. 2 


29611 


29611.0 


Colon array 


299-358 j 


DEX0450 010. nt. 2 


29595 


29595.0 


Colon array 


299-358 


DEX0450 010. nt. 2 


22376 


22376.0 


Breast array 


288-347 


DEX0450 010. nt. 2 


29582 


29582.0 


Colon array 


288-347 
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DEX0450 010. nt. 2; 


29609 i 


29609.0 ( 


:olon array : 


292-351 


DEX0450 010. nt. 2: 


29612 : 


29612.0 ( 


:olon array : 


262-321 


DEX0450 010. nt. 3: 


16747 : 


16747.0 ( 


2olon array : 


2355-2414 


DEX0450 010. nt. 3; 


29595 : 


29595.0 < 


Colon array < 


1246-4305 


DEX0450 010. nt. 3: 


29612 : 


29612.0 < 


Colon array * 


1209-4268 


DEX0450 010. nt. 3: 


29582 : 


29582.0 ( 


Colon array 1 


1235-4294 


DEX0450 010. nt. 3: 


29572 : 


29572.0 < 


Colon array 


1762-3821 


DEX0450 010. nt. 3 


29611 


29611.0 < 


Colon array 


1246-4305 


DEX0450 010. nt. 3 


17829 


17829.0 1 


Breast array 


3840-3899 


DEX0450 010. nt. 3 


29571 


29571.0 


Colon array 


3832-3891 


DEX0450 010. nt. 3 


29609 


29609.0 


Colon array 


4239-4298 


DEX0450 010. nt. 3 


22376 


22376.0 


Breast array 


4235-4294 


DEX0450 Oll.nt.l 


22654 


22654.0 


Colon array 


1967-2026 


DEX0450 012.nt.l 


8377 


8377.0 


Colon array 


406-465 


DEX0450 013.nt.l 


32221 


32221.0 


Colon array 


1359-1418 


DEX0450 013.nt.l 


32254 


32254.0 


Colon array 


390-449 


DEX0450 013.nt.l 


33033 


33033.0 


Colon array 


635-694 


DEX0450 013.nt.l 


32255 


32255.0 


Colon array 


266-325 


DEX0450 013.nt.l 


32220 


32220.0 


Colon array 


1399-1458 


DEX0450 014.nt.l 


32458 


32458.0 


Colon array 


1846-1905 


DEX0450 014.nt.l 


32459 


32459.0 


Colon array 


1806-1865 


DEX0450 015.nt.l 


33503 


33503.0 


Colon array 


7026-7085 


DEX0450 016.nt.l 


35091 


35091.0 


Colon array 


3392-3451 


DEX0450 016.nt.l 


33083 


33083.0 


Colon array 


2718-2777 


DEX0450 016. nt. 2 


35091 


35091.0 


Colon array 


2518-2577 


DEX0450 016. nt. 3 


33083 


33083 .0 


Colon array 


2835-2894 


DEX0450 016. nt. 3 


35091 


35091.0 


Colon array 


3509-3568 


DEX0450 017.nt.l 


36711 


36711.0 


Colon array 


484-543 


DEX0450 017.nt.l 


39770 


39770.0 


Colon array 


486-545 


DEX0450 017.nt.l 


39635 


39635.0 


Colon array 


652-711 


DEX0450 017.nt.l 


36712 


36712.0 


Colon array 


342-401 


DEX0450 017.nt.l 


39769 


39769.0 


Colon array 


526-585 


DEX0450 017.nt.l 


39631 


39631.0 


Colon array 


651-710 


DEX0450 017.nt.l 


39636 


39636.0 


Colon array 


612-671 


DEX0450 017. nt. 2 


39770 


39770.0 


Colon array 


266-325 


DEX0450 017. nt. 2 


39769 


39769.0 


Colon array 


306-365 


DEX0450 017. nt. 2 


36711 


36711.0 


Colon array 


264-323 


DEX0450 017. nt. 2 


39631 


39631.0 


Colon array 


431-490 


DEX0450 017. nt. 2 


39635 


39635.0 


Colon array 


432-491 


DEX0450 017. nt. 3 


39636 


39636.0 


Colon array 


479-538 


DEX0450 017. nt. 3 


39631 


39631.0 


Colon array 


518-577 


DEX0450 017. nt. 3 


39769 


39769.0 


Colon array 


393-452 


DEX0450 017. nt. 3 


39635 


39635.0 


Colon array 


519-578 


DEX0450 018.nt.l 


31424 


31424.0 


Colon array 


2191-2250 


DEX0450 018.nt.l 


31425 


31425.0 


Colon array 


2120-2179 


DEX0450 018. nt. 2 


31424 


31424.0 


Colon array 


2191-2250 


DEX0450 018. nt. 3 


31425 


31425.0 


Colon array 


2120-2179 


DEX0450 018. nt . 3 


31424 


31424.0 


Colon array 


2191-2250 


DEX0450 018. nt. 4 


31425 


31425.0 


Colon array 


506-565 


DEX0450 019.nt.l 


35571 


35571.0 


Colon array 


1385-1444 


DEX0450 019.nt.l 


.32748 


32748.0 


Colon array 


1 c 0 Oil 

752-811 


DEX0450 019.nt.l 


.33067 


33067.0 


Colon array 


1441-1500 


DEX0450 019. nt. 1 


.35570 


35570.0 


Colon array 


1437-1496 


DEX0450 019. nt. 3 


.33066 


33066.0 


Colon array 


1531-1590 


DEX045 0 019. nt.: 


£35571 


35571.0 


Colon array 


1385-1444 


DEX0450 019. nt.: 


133067 


33067.0 


Colon array 


1441-1500 


DEX0450 019. nt.: 


! 32748 


32748.0 


Colon array 


752-811 
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DEX0450 019. nt. 2 


33066 


33066.0 


Colon array 


1531-1590 


DEX0450 019. nt. 2 


35570 


35570.0 


Colon array 


1437-1496 


DEX0450 019. nt. 3 


32748 


32748.0 


Colon array 


752-811 


DEX0450 019. nt. 3 


33067 


33067.0 


Colon array 


1441-1500 


DEX0450 019. nt. 3 


33066 


33066 .0 


Colon array 


1531-1590 


DEX0450 019. nt. 3 


35571 


35571.0 


Colon array 


1385-1444 


DEX0450 020.nt.l 


32972 


32972 .0 


Colon array 


1059-1118 


DEX0450 020.nt.l 


32991 


32991.0 


Colon array 


1226-1285 


DEX0450 020.nt.l 


32992 


32992 .0 


Colon array 


1557-1616 


DEX0450 020.nt.l 


32990 


32990.0 


Colon array 


1246-1305 


DEX0450 020.nt.l 


32995 


32995 .0 


Colon array 


1100-1159 


DEX0450 020.nt.l 


32994 


32994.0 


Colon array 


1130-1189 


DEX0450 020.nt.l 


35546 


35546.0 


Colon array 


1557-1616 


DEX0450 020. nt. 2 


32991 


32991.0 


Colon array 


1885-1944 


DEX0450 020. nt. 2 


32972 


32972 .0 


Colon array 


1718-1777 


DEXO450 020. nt. 2 


32992 


32992 .0 


Colon array 


2216-2275 


DEX0450 020. nt. 2 


32995 


32995.0 


Colon array 


1759-1818 


DEX0450 020. nt. 2 


32990 


32990.0 


Colon array 


1905-1964 


DEX0450 020. nt. 3 


32995 


32995.0 


Colon array 


4462-4521 


DEX0450 020. nt. 3 


32992 


32992 .0 


Colon array 


4919-4978 


DEX0450 020. nt. 3 


32991 


32991.0 


Colon array 


4588-4647 


DEX0450 020. nt. 3 


32990 


32990.0 


Colon array 


4608-4667 


DEX0450 020. nt. 3 


35546 


35546.0 


Colon array 


4919-4978 


DEX0450 020. nt. 3 


32972 


32972.0 


Colon array 


4421-4480 


DEX0450 021.nt.l 


10701 


10701.0 


Colon array 


3600-3659 


DEX0450 021.nt.l 


10744 


10744.0 


Colon array 


4464-4523 


DEX0450 021.nt.l 


10745 


10745.0 


Colon array 


4424-4483 


DEX0450 021.nt.l 


10700 


10700.0 


Colon array 


3685-3744 


DEX0450 022.nt.l 


12058 


12058.0 


Colon array 


975-1034 


DEX0450 022.nt.l 


12057 


12057.0 


Colon array 


1271-1330 


DEX0450 023.nt.l 


22060 


22060.0 


Breast array 


1652-1711 


DEX0450 023.nt.l 


8910 


8910.0 


Colon array 


1898-1957 


DEX0450 023. nt. 2 


22060 


22060.0 


Breast array 


776-835 


DEX0450 023. nt. 3 


8910 


8910.0 


Colon array 


381-440 


DEX0450 024.nt.l 


36564 


36564.0 


Colon array 


315-374 


DEX0450 025.nt.l 


37705 


37705.0 


Colon array 


273-332 


DEX0450 025.nt.l 


77759 


77759.0 


Multi -Cancer array 


46-105 


DEX0450 025.nt.l 


37706 


37706.0 


Colon array 


233-292 


DEX0450 027.nt.l 


35471 


35471.0 


Colon array 


2662-2721 


DEX0450 027.nt.l 


35470 


35470.0 


Colon array 


2787-2846 


DEX0450 027. nt. 2 


35471 


35471.0 


Colon array 


2708-2767 


DEX0450 028.nt.l 


31539 


31539.0 


Colon array 


715-774 


DEX0450 028.nt.l 


31545 


31545.0 


Colon array 


938-997 


DEX0450 029.nt.l 


37947 


37947.02 


Ovarian array 


1100-1159 


DEX0450 030.nt.l 


8410 


8410.0 


Colon array 


174-233 


DEX0450 030.nt.l 


8411 


8411.0 


Colon array 


144-203 


DEX0450 031.nt.l 


957 


957.0 


Lung array 


1843-1902 


DEX0450 031.nt.l 


28423 


28423.0 


Colon array 


284-343 


DEX0450 031.nt.l 


956 


956.0 


Multi -Cancer array 


280-339 


DEX0450 031.nt.l 


17869 


17869.0 


Breast array 


279-338 


DEX0450 031.nt.l 


9880 


9880.01 


Ovarian array 


284-343 


DEX0450 031.nt.l 


31466 


31466.0 


Breast array 


648-707 


DEX045O 031.nt.l 


955 


955.0 


Lung array 


285-344 | 


DEX0450 031.nt.l 


38723 


38723 .01 


Prostate2 array 


285-344 


DEX0450 031. nt. 1 


5750 


5750.0 


Lung array 


1882-1941 


DEX0450 031.nt.l 


835 


835.0 


Lung array 


638-697 


DEX0450 031.nt.l 


21356 


21356.0 


Colon array 


2168-2227 
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DEX0450 031.nt.l 


5254 


5254.0 


Lung array- 


2168-2227 


DEX0450 031.nt.l 


836 


836.0 


Lung array 


518-677 


DEX0450 031.nt.l 


5749 


5749.0 


Lung array 


1767-1826 


DEX0450 031.nt.l 


953 


953 .0 


Lung array 


478-537 


DEX0450 031.nt.2 


5749 


5749.0 ! 


Lung array 


442-501 


DEX0450 031.nt.2 


5750 


5750.0 


Lung array 


557-616 


DEX0450 031.nt.2 


957 


957.0 


Lung array 


518-577 


DEX0450 031. nt. 2 


5254 


5254.0 i 


Lung array 


843-902 


DEX0450 031.nt.2 


21356 


21356.0 


Colon array 


843-902 


DEX0450 032.nt.l 


35945 


35945.03 


Prostate2 array 


851-910 


DEX0450 032.nt.l 


15848 


15848.0 


Breast array 


942-1001 


DEX0450 032.nt.l 


35955 


35955.02 


Prostate2 array 


942-1001 


DEX0450 032.nt.l 


32036 


32036.02 


Prostatel array 


942-1001 


DEX0450 032.nt.l 


19576 


19576.0 


Breast array 


942-1001 


DEX0450 032.nt.l 


16912 


16912.0 


Breast array 


851-910 


DEX0450 032-nt.l 


35907 


35907.03 


Prostate2 array 


851-910 


DEX0450 032.nt.l 


34888 


34888.0 


Colon array 


965-1024 


DEX0450 032.nt.l 


16908 


16908.0 


Breast array 


942-1001 


DEX0450 032. nt. 2 


16912 


16912.0 


Breast array 


5212-5271 


DEX0450 032. nt. 2 


16111 


16111.0 


Breast array 


4379-4438 


DEX0450 032. nt. 2 


35897 


35897.03 


Prostate2 array 


3910-3969 


DEX0450 032. nt. 2 


35959 


35959.02 


Prostate2 array 


4379-4438 


DEX0450 032. nt. 2 


15320 


15320.0 


Breast array 


3570-3629 


DEX0450 032. nt. 2 


31256 


31256.03 


Prostate2 array 


3565-3624 


DEX0450 032. nt. 2 


35907 


35907.03 


Prostate2 array 


5212-5271 


DEX0450 032. nt. 2 


15865 


15865.0 


Breast array 


3565-3624 


DEX0450 032. nt. 2 


16908 


16908.0 


Breast array 


5303-5362 


DEX0450 032. nt. 2 


15848 


15848.0 


Breast array 


5303-5362 


DEX0450 032. nt. 2 


32036 


32036.02 


Prostatel array 


5303-5362 


DEX0450 032. nt. 2 


15317 


15317.0 


Breast array 


5036-5095 


DEX0450 032. nt. 2 


15315 


15315.0 


Breast array 


4823-4882 


DEX0450 032. nt. 2 


35955 


35955.02 


Prostate2 array 


5303-5362 


DEX0450 032. nt. 2 


16910 


16910.0 


Breast array 


4379-4438 


DEX0450 032. nt. 2 


35945 


35945.03 


Prostate2 array 


5212-5271 


DEX0450 032. nt. 2 


32020 


32020.02 


Prostatel array 


1062-1121 


DEX0450 032. nt. 2 


35957 


35957.03 


Prostate2 array- 


4379-4438 


DEX0450 032. nt. 2 


15324 


15324.0 


Breast array 


4113-4172 


DEX0450 032. nt. 2 


19576 


19576 .0 


Breast array 


5303-5362 


DEX0450 032.nt.2 


35919 


35919.02 


Prostate2 array- 


4113-4172 


DEX0450 032. nt. 2 


34888 


34888.0 


Colon array 


5326-5385 


DEX0450 032.nt.2 


35905 


35905.03 


Prostate2 array 


4833-4892 


DEX0450 032. nt. 2 


26953 


26953.02 


Prostatel array 


3763-3822 


DEX0450 032. nt. 2 


20237 


20237.0 


Breast array 


1062-1121 


DEX0450 032. nt. 2 


31591 


31591.0 


Breast array 


3572-3631 


DEX0450 032. nt. 2 


35951 


35951.01 


Prostate2 array 


3574-3633 


DEX0450 032.nt.2 


15830 


15830.0 


Breast array 


3880-3939 


DEX0450 032. nt. 2 


15829 


15829.0 


Breast array 


3910-3969 


DEX0450 032. nt. 2 


16906 


16906.0 


Breast array 


3574-3633 


DEX0450 032. nt. 2 


31590 


31590.0 


Breast array 


3763-3822 


DEX0450 032. nt. 2 


35901 


35901.01 


Prostate2 array 


3570-3629 


DEX0450 032. nt. 3 


35919 


35919.02 


Prostate2 array 


4897-4956 


DEX0450 032. nt. 3 


16111 


16111.0 


Breast array 


5163-5222 


DEX0450 032. nt. 3 


31256 


31256.03 


Prostate2 array 


4349-4408 


DEX0450 032. nt. 3 


15848 


15848.0 


Breast array 


6087-6146 


DEX0450 032. nt. 3 


32036 


32036.02 


Prostatel array 


6087-6146 


DEX0450 032. nt. 3 


26953 


26953.02 


Prostatel array 


4547-4606 


DEX0450 032. nt. 3 


35955 


35955.02 


Prostate2 array 


6087-6146 



•i 
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DEX0450 032. nt. 3 


15315 


15315.0 


Breast array 


5607-5666 


DEX0450 032. nt. 3 


35951 


35951.01 


Prostate2 array 


4358-4417 


DEX0450 032. nt. 3 


15830 


15830.0 


Breast array 


4664-4723 


DEX0450 032. nt. 3 


35901 


35901.01 


Prostate2 array 


4354-4413 


DEX0450 032. nt. 3 


35897 


35897.03 


Prostate2 array 


4694-4753 


DEX0450 032. nt. 3 


35957 


35957.03 


Prostate2 array 


5163-5222 


DEX0450 032. nt. 3 


35945 


35945.03 


Prostate2 array 


5996-6055 


DEX0450 032. nt. 3 


32020 


32020.02 


Pros tat el array 


1062-1121 


DEX0450 032. nt. 3 


35959 


35959 .02 


Prostate2 array 


5163-5222 


DEX0450 032. nt. 3 


15324 


15324.0 


Breast array 


4897-4956 


DEX0450 032. nt. 3 


19576 


19576.0 


Breast array 


6087-6146 


DEX0450 032. nt. 3 


15320 


15320.0 


Breast array 


4354-4413 


DEX0450 032. nt. 3 


16908 


16908 .0 


Breast array 


6087-6146 


DEX0450 032. nt. 3 


34888 


34888.0 


Colon array 


6110-6169 


DEX0450 032. nt. 3 


15317 


15317.0 


Breast array 


5820-5879 


DEX0450 032. nt. 3 


35905 


35905 .03 


Prostate2 array 


5617-5676 


DEX0450 032. nt. 3 


35907 


35907.03 


Prostate2 array 


5996-6055 


DEX0450 032. nt. 3 


15865 


15865.0 


Breast array 


4349-4408 


DEX0450 032. nt. 3 


20237 


20237.0 


Breast array 


1062-1121 


DEX0450 032. nt. 3 


31591 


31591.0 


Breast array 


4356-4415 


DEX0450 032. nt. 3 


15829 


15829.0 


Breast array 


4694-4753 


DEX0450 032. nt. 3 


16910 


16910.0 


Breast array 


5163-5222 


DEX0450 032. nt. 3 


16906 


16906.0 


Breast array 


4358-4417 


DEX0450 032.nt.3 


31590 


31590.0 


Breast array 


4547-4606 


DEX0450 032. nt. 3 


16912 


16912.0 


Breast array 


5996-6055 


DEX0450 032. nt. 4 


35897 


35897.03 


Prostate2 array 


1668-1727 


DEX0450 032. nt. 4 


15320 


15320.0 


Breast array 


1328-1387 


DEX0450 032. nt. 4 


16111 


16111.0 


Breast array 


2137-2196 


DEX0450 032. nt. 4 


35959 


35959.02 


Prostate2 array 


2137-2196 


DEX0450 032. nt. 4 


16908 


16908.0 


Breast array 


3061-3120 


DEX0450 032. nt. 4 


31256 


31256.03 


Prostate2 array 


1323-1382 


DEX0450 032. nt. 4 


15848 


15848.0 


Breast array 


3061-3120 


DEX0450 032. nt. 4 


32036 


32036.02 


Prostatel array 


3061-3120 


DEX0450 032. nt. 4 


15865 


15865.0 


Breast array 


1323-1382 


DEX0450 032. nt. 4 


35907 


35907.03 


Prostate2 array 


2970-3029 


DEX0450 032. nt. 4 


15317 


15317.0 


Breast array 


2794-2853 


DEX0450 032. nt. 4 


35955 


35955.02 


Prostate2 array 


3061-3120 


DEX0450 032. nt. 4 


16910 


16910.0 


Breast array 


2137-2196 


DEX0450 032. nt. 4 


15315 


15315.0 


Breast array 


2581-2640 


DEX0450 032. nt. 4 


35945 


35945.03 


Prostate2 array 


2970-3029 


DEX0450 032. nt. 4 


35957 


35957.03 


Prostate2 array 


2137-2196 


DEXO450 032. nt. 4 


32020 


32020.02 


Prostatel array 


1062-1121 


DEX0450 032. nt. 4 


15324 


15324.0 


Breast array 


1871-1930 


DEX0450 032. nt. 4 


35919 


35919.02 


Prostate2 array 


1871-1930 


DEX0450 032. nt. 4 


19576 


19576.0 


Breast array 


3061-3120 


DEX0450 032. nt. 4 


34888 


34888.0 


Colon array 


3084-3143 


DEX0450 032. nt. 4 


35905 


35905.03 


Prostate2 array 


2591-2650 


DEX0450 032. nt. 4 


20237 


20237.0 


Breast array 


1062-1121 


DEX0450 032. nt. 4 


31591 


31591.0 


Breast array 


1330-1389 


DEX0450 032. nt. 4 


26953 


26953.02 


Prostatel array 


1521-1580 


DEX0450 032. nt. 4 


15829 


15829.0 


Breast array 


1668-1727 


DEX0450 032. nt. 4 


15830 


15830.0 


Breast array 


1638-1697 


DEX0450 032. nt. 4 


31590 


31590.0 


Breast array 


1521-1580 


DEX0450 032. nt. 4 


16906 


16906.0 


Breast array 


1332-1391 


DEX0450 032. nt. 4 


35951 


35951.01 


Prostate2 array 


1332-1391 


DEX0450 032. nt. 4 


35901 


35901.01 


Prostate2 array 


1328-1387 


DEX0450 032. nt. 5 


35919 


35919.02 


Prostate2 array 


4572-4631 
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DEX0450 032. nt. 5 


16111 


16111.0 


Breast array 


4838-4897 


DEX0450 032. nt. 5 


20237 


20237.0 


Breast array 


1062-1121 


DEX0450 032. nt. 5 


31256 


31256.03 


Prostate2 array 


4024-4083 


DEX0450 032. nt. 5 


35901 


35901.01 


Prostate2 array 


4029-4088 


DEX0450 032. nt. 5 


35957 


35957.03 


Prostate2 array 


4838-4897 


DEX0450 032. nt. 5 


15848 


15848.0 


Breast array 


5762-5821 


DEX0450 032. nt. 5 


15830 


15830.0 


Breast array 


4339-4398 


DEX0450 032. nt. 5 


15315 


15315.0 


Breast array 


5282-5341 


DEX0450 032. nt. 5 


26953 


26953.02 


Prostatel array 


4222-4281 


DEX0450 032. nt. 5 


35955 


35955.02 


Prostate2 array 


5762-5821 


DEX0450 032. nt. 5 


35951 


35951.01 


Prostate2 array 


4033-4092 


DEX0450 032. nt. 5 


32036 


32036.02 


Prostatel array 


5762-5821 


DEX0450 032. nt. 5 


15324 


15324 .0 


Breast array 


4572-4631 


DEX0450 032. nt. 5 


19576 


19576.0 


Breast array 


5762-5821 


DEX0450 032. nt. 5 


34888 


34888.0 


Colon array 


5785-5844 


DEX0450 032. nt. 5 


15320 


15320.0 


Breast array 


4029-4088 


DEX0450 032. nt. 5 


32020 


32020.02 


Prostatel array 


1062-1121 


DEX0450 032. nt. 5 


35959 


35959.02 


Prostate2 array 


4838-4897 


DEX0450 032. nt. 5 


16908 


16908.0 


Breast array 


5762-5821 


DEX0450 032. nt. 5 


35945 


35945.03 


Prostate2 array 


5671-5730 


DEX0450 032 -nt. 5 


35897 


35897.03 


Prostate2 array 


4369-4428 


DEX0450 032. nt. 5 


31590 


31590.0 


Breast array 


4222-4281 


DEX0450 032. nt. 5 


15865 


15865.0 


Breast array 


4024-4083 


DEX0450 032. nt. 5 


15317 


15317.0 


Breast array 


5495-5554 


DEX0450 032. nt. 5 


31591 


31591.0 


Breast array 


4031-4090 


DEX0450 032. nt. 5 


16910 


16910.0 


Breast array 


4838-4897 


DEX0450 032. nt. 5 


35905 


35905.03 


Prostate2 array 


5292-5351 


DEX0450 032. nt. 5 


16912 


16912.0 


Breast array 


5671-5730 


DEX0450 032. nt. 5 


16906 


16906.0 


Breast array 


4033-4092 


DEX0450 032. nt. 5 


15829 


15829.0 


Breast array 


4369-4428 


DEX0450 032. nt- 5 


35907 


35907.03 


Pros tat e2 array 


5671-5730 


DEX0450 032. nt. 6 


16910 


16910.0 


Breast array 


4838-4897 


DEX0450 032. nt. 6 


31256 


31256.03 


Prostate2 array 


4024-4083 


DEX0450 032. nt. 6 


16908 


16908.0 


Breast array 


5582-5641 


DEX0450 032. nt. 6 


15324 


15324 .0 


Breast array 


4572-4631 


DEX0450 032. nt. 6 


35901 


35901.01 


Prostate2 array 


4029-4088 


DEX0450 032. nt. 6 


15315 


15315.0 


Breast array 


5282-5341 


DEX0450 032. nt . 6 


35957 


35957.03 


Prostate2 array 


4838-4897 


DEX0450 032. nt. 6 


16912 


16912.0 


Breast array 


5491-5550 


DEX0450 032. nt. 6 


16111 


16111.0 


Breast array 


4838-4897 


DEX0450 032. nt . 6 


15865 


15865.0 


Breast array 


4024-4083 


DEX0450 032. nt. 6 


34888 


34888.0 


Colon array 


5605-5664 


DEX0450 032. nt. 6 


15320 


15320.0 


Breast array 


4029-4088 


DEX0450 032. nt . 6 


35919 


35919.02 


Prostate2 array 


4572-4631 


DEX0450 032. nt. 6 


26953 


26953.02 


Prostatel array 


4222-4281 


DEX0450 032. nt. 6 


20237 


20237.0 


Breast array 


1062-1121 


DEX0450 032. nt. 6 


35897 


35897.03 


Prostate2 array 


4369-4428 


DEX0450 032. nt. 6 


35945 


35945.03 


Prostate2 array 


5491-5550 


DEX0450 032. nt. 6 


15829 


15829.0 


Breast array 


4369-4428 


DEX0450 032. nt. 6 


32036 


32036.02 


Prostatel array 


5582-5641 


DEX0450 032. nt. 6 


35955 


35955.02 


Prostate2 array 


5582-5641 


DEX0450 032. nt. 6 


32020 


32020.02 


Prostatel array 


1062-1121 


DEX0450 032. nt. 6 


15830 


15830.0 


Breast array 


4339-4398 


DEX0450 032. nt. 6 


35959 


35959.02 


Prostate2 array 


4838-4897 


DEX0450 032. nt. 6 


35905 


35905.03 


Prostate2 array 


5292-5351 


DEX0450 032. nt. 6 


35951 


35951.01 


Prostate2 array 


4033-4092 


DEX0450 032. nt. 6 


19576 


19576.0 


Breast array 


5582-5641 
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DEX0450 032. nt . 6 


15848 


15848.0 


Breast array 


5582-5641 


DEX0450 032. nt. 6 


31591 


31591.0 


Breast array- 


4031-4090 


DEX0450 032. nt. 6 


31590 


31590.0 


Breast array 


4222-4281 


DEX0450 032. nt. 6 


16906 


16906.0 


Breast array 


4033-4092 


DEX0450 032. nt. 6 


35907 


35907.03 


Prostate2 array 


5491-5550 


DEX0450 032. nt. 7 


32036 


32036.02 


Prostatel array 


5278-5337 


DEX0450 032. nt. 7 


16912 


16912.0 


Breast array 


5187-5246 


DEX0450 032. nt. 7 


31591 


31591.0 


Breast array 


4031-4090 


DEX0450 032. nt. 7 


19576 


19576.0 


Breast array 


5278-5337 


DEX0450 032. nt. 7 


35919 


35919.02 


Prostate2 array 


4323-4382 


DEX0450 032. nt. 7 


15865 


15865.0 


Breast array 


4024-4083 


DEX0450 032. nt. 7 


35945 


35945 .03 


Prostate2 array 


5187-5246 


DEX0450 032. nt. 7 


35907 


35907.03 


Prostate2 array 


5187-5246 


DEX0450 032.nt.7 


31590 


31590.0 


Breast array 


4222-4281 


DEX0450 032. nt. 7 


15315 


15315.0 


Breast array 


4798-4857 


DEX0450 032. nt. 7 


15848 


15848.0 


Breast array 


5278-5337 


DEX0450 032. nt. 7 


35955 


35955.02 


Prostate2 array 


5278-5337 


DEX0450 032. nt. 7 


26953 


26953.02 


Prostatel array 


4222-4281 


DEX0450 032. nt. 7 


15317 


15317.0 


Breast array 


5011-5070 


DEX0450 032. nt. 7 


15320 


15320.0 


Breast array 


4029-4088 


DEX0450 032. nt. 7 


32020 


32020.02 


Prostatel array 


1062-1121 


DEX0450 032. nt. 7 


20237 


20237.0 


Breast array 


1062-1121 


DEX0450 032. nt. 7 


15324 


15324.0 


Breast array 


4323-4382 


DEX0450 032. nt. 7 


31256 


31256.03 


Prostate2 array 


4024-4083 


DEX0450 032. nt. 7 


35905 


35905.03 


Prostate2 array 


4808-4867 


DEX0450 032. nt. 7 


35901 


35901.01 


Prostate2 array 


4029-4088 


DEX0450 032. nt. 7 


16908 


16908.0 


Breast array 


5278-5337 


DEX0450 032. nt. 7 


35951 


35951.01 


Prostate2 array 


4033-4092 


DEX0450 032. nt. 7 


34888 


34888.0 


Colon array 


5301-5360 


DEX0450 032. nt. 7 


16906 


16906.0 


Breast array 


4033-4092 


DEX0450 033.nt.l 


1234 


1234.0 


Lung array 


528-587 


DEX0450 033.nt.l 


18480 


18480.02 


Ovarian array 


528-587 


DEX0450 033.nt.l 


30532 


30532.0 


Colon array 


526-585 


DEX0450 033. nt. 2 


34642 


34642.0 


Colon array 


1335-1394 


DEX0450 033. nt. 2 


30532 


30532.0 


Colon array 


525-584 


DEX0450 033. nt. 2 


1234 


1234.0 


Lung array 


527-586 


DEX0450 033. nt. 2 


24954 


24954.02 


Prostatel array 


1306-1365 


DEX0450 033. nt. 2 


34643 


34643 .0 


Colon array 


1295-1354 


DEX0450 033. nt. 2 


18480 


18480.02 


Ovarian array 


527-586 


DEX0450 033. nt. 2 


5421 


5421.0 


Lung array 


1233-1292 


DEX0450 033. nt. 2 


28019 


28019.02 


Prostatel array 


1335-1394 


DEX0450 034.nt.l 


37498 


37498.0 


Colon array 


409-468 ! 


DEX0450 035.nt.l 


20153 


20153.0 


Colon array 


1276-1335 


DEX0450 035.nt.l 


20154 


20154.0 


Colon array 


1230-1289 


DEX0450 036.nt.l 


5961 


5961.0 


Lung array 


1321-1380 


DEX0450 036.nt.l 


37616 


37616.0 


Colon array 


1951-2010 


DEX0450 036.nt.l 


37635 


37635.0 


Colon array 


1975-2034 


DEX0450 036.nt.l 


37615 


37615.0 


Colon array 


1991-2050 


DEX0450 037.nt.l 


1022 


1022.0 


Lung array 


480-539 


DEX0450 037.nt.l 


12335 


12335.02 


Ovarian array 


374-433 


UfiAU ft DU Uj / .IlC. 1 


3 i /4 1 


33741 . 0 


Colon array 


408-467 


DEX0450 037.nt.l 


29069 


29069.01 


Prostate2 array 


374-433 


DEX0450 037.nt.l 


1023 


1023.0 


Lung array 


425-484 


DEX0450 037.nt.l 


36689 


36689.01 


Prostatel array 


547-606 


DEX0450 037.nt.l 


29065 


29065.03 


Prostate2 array 


480-539 


DEX0450 038.nt.l 


38976 


38976.0 


Colon array 


457-516 


DEX0450 038.nt.l 


38975 


38975.0 


Colon array 


483-542 | 
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DEX0450 039.nt.l 


38502 


38502 .0 


Colon array 


157-516 


DEX0450 039.nt.l 


38501 


38501.0 


Colon array ! 


558-617 


DEX0450 039. nt . 2 


38499 


38499.0 


Solon array 


1002-1061 


DEX0450 039. nt. 2 


36599 


36599.0 


Colon array 


779-838 


DEX0450 03 9.nt.2 


38501 


38501.0 


Colon array 


559-618 


DEX0450 039. nt. 2 


38502 


38502.0 


Colon array 


458-517 


DEX0450 039. nt. 2 


38509 


38509.0 


Colon array 


996-1055 


DEX0450 03 9.nt.2 


38510 


38510.0 


Colon array 


799-858 


DEX0450 03 9.nt.2 


38428 


38428 .0 


Colon array 


1001-1060 


DEX0450 039. nt. 3 


38499 


38499.0 


Colon array 


878-937 


DEX0450 03 9.nt.3 


38510 


38510.0 


Colon array 


675-734 


DEX0450 039. nt. 3 


36599 


36599.0 


Colon array 


655-714 


DEX0450 039. nt. 3 


38509 


38509.0. 


Colon array 


872-931 


DEX0450 039. nt. 3 


38428 


38428.0 


Colon array 


877-936 


DEX0450 039. nt. 3 


38501 


38501.0 


Colon array 


435-494 


DEX0450 039. nt. 4 


38510 


38510.0 


Colon array 


1091-1150 


DEX0450 039. nt. 4 


36599 


36599.0 


Colon array 


1071-1130 


DEX0450 03 9.nt.4 


38501 


38501.0 


Colon array 


793-852 


DEX0450 039. nt. 4 


38502 


38502.0 


Colon array 


692-751 


DEX0450 039.nt.4 


38428 


38428.0 


Colon array 


1293-1352 


DEX0450 03 9.nt.4 


38509 


38509.0 


Colon array 


1288-1347 


DEX0450 03 9.nt.4 


38499 


38499.0 


Colon array 


1294-1353 


DEX0450 039. nt. 5 


38501 


38501.0 


Colon array 


793-852 


DEX0450 039. nt. 5 


38502 


38502.0 


Colon array 


692-751 


DEX0450 039. nt. 6 


38501 


38501.0 


Colon array 


793-852 I 


DEX0450 040.nt.l 


39582 


39582.0 


Colon array 


545-604 


DEX0450 04 0.nt.l 


35521 


35521.0 


Colon array 


498-557 


DEX0450 040.nt.l 


39581 


39581.0 


Colon array 


655-714 


DEX0450 04 0,nt.l 


35520 


35520.0 


Colon array 


655-714 


DEX0450 041.nt.l 


36756 


36756 .0 


Colon array 


110-169 


DEX0450 041.nt.l 


33072 


33072.0 


Colon array 


470-529 


DEX0450 041.nt.l 


33075 


33075.0 


Colon array 


979-1038 


DEX0450 041.nt.l 


29313 


29313 .03 


Prostate2 array 


1076-1135 


DEX0450 041.nt.l 


36755 


36755.0 


Colon array 


130-189 


DEX0450 041.nt.l 


37290 


37290.0 


Colon array 


1167-1226 


DEX0450 041.nt.l 


33074 


33074 .0 


Colon array 


1076-1135 


DEX0450 041.nt.l 


29311 


29311.03 


Prostate2 array 


470-529 


DEX0450 042.nt.l 


33118 


33118.0 


Colon array 


3762-3821 


DEX0450 042.nt.l 


40737 


40737.0 


Colon array 


2151-2210 


DEX0450 042.nt.l 


34009 


34009.0 


Colon array 


3065-3124 


DEX0450 042.nt.l 


33119 


33119.0 


Colon array 


3722-3781 


DEX0450 042. nt. 2 


33118 


33118.0 


Colon array 


2347-2406 


DEX0450 042. nt. 2 


34009 


34009.0 


Colon array 


1650-1709 


DEX0450 042. nt. 2 


33119 


33119.0 


Colon array 


2307-2366 


DEX0450 042. nt. 2 


40737 


40737.0 


Colon array 


736-795 


DEX0450 043.nt.l 


35881 


35881.0 


Colon array 


2681-2740 


DEX0450 043.nt.l 


36670 


36670.0 


Colon array 


6794-6853 


DEX0450 043.nt.l 


35882 


35882.0 


Colon array 


2641-2700 


DEX0450 043.nt.l 


36669 


36669.0 


Colon array 


6919-6978 


DEX0450 044.nt.l 


24211 


24211.0 


Breast array 


173-232 


DEX0450 044.nt.l 


9110 


9110.0 


Colon array 


206-265 


DEX0450 044.nt.l 


24210 


24210.0 


Breast array 


206-265 


DEX0450 044. nt. 2 


9110 


9110.0 


Colon array 


1624-1683 


DEX0450 044. nt. 2 


24211 


24211.0 


Breast array 


1591-1650 


DEX0450 044. nt. 2 


24210 


24210.0 


Breast array 


1624-1683 


DEX0450 044. nt. 3 


9110 


9110.0 


Colon array 


794-853 


DEX0450 044. nt. 3 


24210 


24210.0 


Breast array 


794-853 
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DEX0450 044. nt . 3 


24211 


24211.0 


Breast array 


761-820 


DEX0450 045.nt.l 


5599 


5599.0 


Lung array 


873-932 


DEX0450 045.nt.l 


19854 


19854.0 


Breast array 


785-844 


DEX0450 045.nt.l 


13784 


13784 .0 


Colon array 


765-824 


DEX0450 045.nt.l 


19853 


19853 .0 


Breast array 


805-864 


DEX0450 045.nt.l 


13783 


13783.0 


Colon array 


805-864 


DEX0450 046.nt.l 


10297 


10297.0 


Colon array 


353-412 


DEX0450_047.nt.l 


38789 


38789.0 


Colon array 


1517-1576 


DEX0450 047.nt.l 


30511 


30511.0 


Colon array 


611-670 


DEX0450 047.nt.l 


39412 


39412.0 


Colon array 


1515-1574 


DEX0450 048.nt.l 


41177 


41177.0 


Colon array 


549-608 


DEX0450 048.nt.l 


41178 


41178.0 


Colon array 


519-578 


DEX0450 049.nt.l 


31424 


31424 .0 


Colon array 


603-662 


DEX0450 049.nt.l 


31425 


31425.0 


Colon array 


532-591 


DEX0450 050.nt.l 


9398 


9398.0 


Colon array 


808-867 


DEX0450 OSO.nt.l 


9399 


9399/0 


Colon array 


764-823 


DEX0450 052.nt.l 


37943 


37943 .0 


Colon array 


522-581 


DEX0450 052.nt.l 


37944 


37944 .0 


Colon array 


402-461 


DEX0450 053.nt.l 


9398 


9398.0 


Colon array 


809-868 


DEX0450 053-nt.l 


9399 


9399.0 


Colon array 


765-824 


DEX0450 053.nt.l 


31645 


31645.0 


Breast array 


1875-1934 


DEXO450 053.nt.l 


31644 


31644.0 


Breast array 


1945-2004 



Example 2b: Relative Quantitation of Gene Expression 

Real-Time quantitative PGR with fluorescent Taqman® probes is a quantitation 
detection system utilizing the 5'- 3' nuclease activity of Taq DNA polymerase. The 
5 method uses an internal fluorescent oligonucleotide probe (Taqman®) labeled with a 5' 
reporter dye and a downstream, 3' quencher dye. During PCR, the 5'-3* nuclease activity 
of Taq DNA polymerase releases the reporter, whose fluorescence can then be detected by 
the laser detector of the Model 7700 Sequence Detection System (PE Applied Biosystems, 
Foster City, CA, USA). Amplification of an endogenous control is used to standardize the 

10 amount of sample RNA added to the reaction and normalize for Reverse Transcriptase 
(RT) efficiency. Either cyclophiiin, glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH), ATPase, or 18S ribosomal RNA (rRNA) is used as this endogenous control. 
To calculate relative quantitation between all the samples studied, the target RNA levels 
for one sample were used as the basis for comparative results (calibrator). Quantitation 

15 relative to the "calibrator" can be obtained using the comparative method (User Bulletin 
#2: ABI PRISM 7700 Sequence Detection System). 

The tissue distribution and the level of the target gene are evaluated for every 
sample in normal and cancer tissues. Total RNA is extracted from normal tissues, cancer 
tissues, and from cancers and the corresponding matched adjacent tissues. Subsequently, 

20 first strand cDNA is prepared with reverse transcriptase and the polymerase chain reaction 
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is done using primers and Taqman® probes specific to each target gene. The results are 
analyzed using the ABI PRISM 7700 Sequence Detector. The absolute numbers are 
relative levels of expression of the target gene in a particular tissue compared to the 
calibrator tissue. 

5 One of ordinary skill can design appropriate primers. The relative levels of 

expression of the CSNA versus normal tissues and other cancer tissues can then be 
determined. All the values are compared to the calibrator. Normal RNA samples are 
commercially available pools, originated by pooling samples of a particular tissue from 
different individuals. 

10 The relative levels of expression of the CSNA in pairs of matched samples may 

also be determined. A matched pair is formed by mRNA from the cancer sample for a 
particular tissue and mRNA from the normal adjacent sample for that same tissue from the 
same individual. All the values are compared to the calibrator. 

In the analysis of matching samples, the CSNAs show a high degree of tissue 

15 specificity for the tissue of interest. These results confirm the tissue specificity results 
obtained with normal pooled samples. Further, the level of mRNA expression in cancer 
samples and the isogenic normal adjacent tissue from the same individual are compared. 
This comparison provides an indication of specificity for the cancer state (e.g. higher 
levels of mRNA expression in the cancer sample compared to the normal adjacent). 

20 Informaton on the samples tested in the QPCR experiments below include the 

Sample ID (Smpl ED), Tissue, Tissue Type (Tiss Type), Diagnosis (DIAG), Disease 
Detail, and Stage or Grade (STG or GRD) in following table. 



Sampl 
e ID 


Tissue 


TISS 
TYPE 


DIAG 


DISEASE DETAIL 


STG or GRD 


AS12 j 


Colon 


CAN 




T 


StageB 


AS12 


Colon 


NAT 




NL 




AS46 


Colon 


CAN 




malignant 


T3N1MX 


AS46 


Colon 


NAT 




NAT 




B34 


Colon 


CAN 


Adenocarcinoma 






B34 


Colon 


NAT 




NAT 




C9XR 


Colon 


CAN 




Rectum Cancer 


Stage D 


C9XR 


Colon 


NAT 




NAT 




CM67 


Colon 


CAN 


Adenocarcinoma 


Adenocarcinoma 
of cecum, 
Moderately- 
differentiated 


Stage II 


CM67 


Colon 


NAT 




NAT 




TX89 


Colon 


CAN 


Adenocarcinoma 


Adenocarcinoma 
of Transverse 
Colon 


Stave IV 


TX89 


Colon 


NAT 




NAT 
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AS43 


Colon 


CAN 


Adenocarcinoma 


malignant 




AS43 


Colon 


NAT 


Adenocar c i noma 


NAT 




AS98 


Colon 


CAN 


Adenocarcinoma 


Moderately to 
poorly 

differentiated 
adenocarcinoma 


Duke's C 


AS98 


Colon 


NAT 




NAT 




RS53 


Colon 


CAN 


Adenocarcinoma 


moderately 

differentiated 

adenocarcinoma 




RS53 


Colon 


NAT 


Adenocarcinoma 


NAT 




RC01 


Colon 


CAN 


Cancer 




Stage IV 


RC01 


Colon 


NAT 




NAT 




SG27 


Colon 


CAN 




malig 


Stage B 


SG27 


Colon 


NAT 




NAT 




DC19 


Colon 


CAN 




T 


Stage B 


DC19 


Colon 


NAT 




NL 




401C 


Colon 


CAN 


Adenocarcinoma 


Adenocarcinoma 
of ascending 
colon and cecum 


Stage III 


401C 


Colon 


NAT 




NAT 




CM12 


Colon 


CAN 




T 


Stage D 


CM12 


Colon 


NAT 


Adenocar c i noma 


Nat 




TX01 


Colon 


CAN 


Adenocarcinoma 


Moderately 
differentiated 
adenocarcinoma 
of cecum 


Stage II; 
T3NoMo 


TX01 


Colon 


NAT 




NAT 




DC22 


Colon 


CAN 




Cancer 




DC22 


Colon 


NAT 




NAT 




030B 


Urinary 
Bladder 


CAN 


Carcinoma 


invasive 

Carcinoma , poorly 
differentiated 


Stage 

III, Grade 3 


030B 


Urinary- 
Bladder 


NAT 




NAT 




TR17 


Urinary 
Bladder 


CAN 


Carcinoma 


transitional 
cell carcinoma 


Stagell/Gra 
delll 


TR17 


Urinary 
Bladder 


NAT 




NAT 




520B 


Urinary 
Bladder 


CAN 


Sarcoma toid 
transitional 
cell carcinoma 


Sarcomatoid 
transitional 
cell carcinoma 




520B 


Urinary 
Bladder 


NAT 




NAT 




KS52 


Cervix 


CAN 


Squamous cell 
carcinoma 


Keratinizing 
Squamous Cell 
Carcinoma 


IIIB, well 
diff. Gl; 
T3bNxM0 


KS52 


Cervix 


NAT 




NAT 




NK23 


Cervix 


CAN 




Nonkeratinizing 
Large Cell 


FIGO IIIB, 
undiff. G4; j 
T3bNxM0 


NK23 


Cervix 


NAT 




NAT 




NKS54 


Cervix 


CAN 


Squamous cell 
carcinoma 


Nonkeratinizing 
Squamous Cell 
Carcinoma 


I IB, mod 
diff. G2; 
T2bNxM0 


NKS54 


Cervix 


NAT 




NAT 





WO 2004/050858 



PCT/US2003/038808 



238 





LCI V J. A. 


CAN 


O U. d 1 1 l\J Li. \-ClJ. 

parri noma 


iM Wi liVCL CX L Xlll 4 _L ny 

Squamous Cell 
Carcinoma 


TTjn Mod 
diff. G2 ; 
T3bNxM0 




Cervix 


NAT 




NAT 




NKS81 


Cervix 


CAN 


Squamous cell 
carcinoma 


large cell 
nonkeratinizing 
sq care, IIB, 
moderately diff 


IIB 


NKS81 


Cervix 


NAT 




NAT 




1M IvD ^ 3 


Cervix 


PAN 








NKS25 


Cervix 


NAT 




NAT 




vrv-oi ft 


Cervix 


CAN 


Squamous cell 

parpi noma 


Nonke ra t i n i z i ncr 
squamous cell 
carcinoma 


Gil 


NKS18 


Cervix 


NAT 




NAT 




10479 


dliULJULc U 

rium 


CAN 




malignant mixed 
mullerian tumor 


T?, NX, Ml 


10479 


Endomet 
rium 


NAT 




NAT 




28XA 


Endomet 
rium 


CAN 


Endometrial 
adenocarcinoma 


ma T t rtn ant" 
lllal Xy ilctll L. 


TT /TTT 

XX/ XXX 


28XA 


Endomet 
rium 


VT7VT« 

NA1 






TT /TTT 

XX/ XXX 


8XA 


C t"» /"I f\ m A t~ 

riura 


CAN 


mod. diff, 
invasive, 
s quauiou S 

Uli. J-Cl CI1I Lu L 

n, FIGO-II 






8XA 


Endomet 
rium 


NAT 




NAT 






tC'i i^ti 


CAN 


parpi noma 

L-.CL J. ^XUWU Id 


r<=»nal eel 1 
carcinoma clear 
cell, localized 


3 


iUDAU 




NAT 




NL 




1 o*7Yn 

1U / A.U 


ITt Hn at/ 

i\.xuiicy 


PAN 


Renal cell 

pavpi noma 


renal cell 
carcinoma, clear 
cell, with 
metastatic 


G III 




i\.x uney 


NAT 




NL 




109XD 


Kidney 


CAN 




Malignant 


G III 


i n Qvn 


j\.ianey 


vr at 




NL 




X UAJJ 




CAN 


Dpnal cell 
carcinoma 


renal cell 
carcinoma, clear 
cell, localized, 
grade 2-3 


3 


10XD 


Kidney 


NAT 




NL 




Z Z IV 




PAN 


parpi noma 


Rpnal cell 

carcinoma 


G2 , Mod . 
Diff. 


22K 


Kidney 


NAT 




NAT 




12XD 


Kidney 


CAN 


Renal cell 
carcinoma 


Left renal cell 
carcinoma 




12XD 


Kidney 


NAT 




NAT 




15XA 


Liver 


CAN 




Sarcoma , Retroper 
itoneal Tumor 


Grade -2 


15XA 


Liver 


NAT 




CA 


St. I, G4 


174L 


Liver 


CAN 


Hepatocel lular 
carcinoma 


Moderate to well 
differentiated 
hepatocellular 
carcinoma 
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174L 


Liver 


NAT 


Hepatocellular 

ca rci noma 


NAT 




187L 


Liver 


CAN 


Adenocarcinoma 


Metastatic 
Adenocarcinoma 


Liver 

(Gallbladde 
r) 


187L 


Liver 


NAT 




NAT 




205Ij 


Lung 


CAN 


Adenocarcinoma 


poorly 

differentiated 
adenocarcinoma 


T2, Nl, Mx 


205L 


Lung 


NAT 




NAT 




315L 


Lung 


CAN 


Squamous cell 
carcinoma 






315L 


Lung 


NAT 


Adenocarcinoma 


NAT 




507L 


Lung 


CAN 


Rrnnrh i nl oa 1 VP 

olar carcinoma 


hronrhioalveolar 
carcinoma 


Stage IB, 
Gl well 
diff. 


507L 


Lung 


NAT 




NAT 














St IV T2N0M 
1/ 

inf iltratin 
g poorly 
diff. 


528L 


Lung 


CAN 


Adenocarcinoma 


Adenocarcinoma 


528L 


Lung 


NAT 




NAT 




8837L 


Lung 


CAN 


Squamous cell 
carcinoma 


Squamous cell 
carcinoma 


T2, NO, MO 


8837L 


Lung 


NAT 




NAT 




AC11 


Lung 


CAN 


Adenocarcinoma 


poorly 

differentiated 
adenocarcinoma 


T2, N2, Ml 


AC11 


Lung 


NAT 




NAT 




AC39 


Lung 


CAN 


Adenocarcinoma 


intermediate 
grade 

adnocarcinoma 


T2, N2, Mx 


AC39 


Lung 


NAT 




NAT 




SQ80 


Lung 


CAN 


Scruamou^ pell 
carcinoma 


poorly 

d"i F feTent* iat" pd 

\JL JL JL *- CJL Cllb-LdU ^luL 

<3cmamoiJ.s cell 
carcinoma 


Tl, Nl, MO 


SQ80 


Lung 


NAT 




NAT 




SQ81 


Lung 


CAN 


Squamous cell 
carcinoma 


puui jl y 

differentiated 

squamous 

carcinoma 


T3, Nl, Mx 


SQ81 


Lung 


NAT 




NAT 




19DN 


Mammary 


CAN 


Invasive 

ductal 

carcinoma 


Invasive ductal 
carcinoma 


G3 , Stage 
IIA; T2N0M0 


19DN 


Mammary 


NAT 




NAT 




42DN 


Mammary 


CAN 


Invasive 

ductal 

carcinoma 


Invasive Ductal 
Carcinoma 


T3aNlM0 
IIIA, G3 


42DN 


Mammary 


NAT 




NAT 




517 


Mammary 


CAN 


Infiltrating 

ductal 

carcinoma 


Infiltrating 
ductal carcinoma 


St. IIA, G3 


517 


Mammary 


NAT 




NAT 





I 
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781M 




CAN 


Invasive 
ductal 
care i noma 




Architectur 
al grade - 
3/3, Nuclear 
grade- 3/3 


781M 


Mamma, rv 


NAT 




NAT 




869M 


Mammarv 


CAN 


Invasive 
carcinoma 


Invasive 
Carcinoma 


Stage IIA 
Gl;T2NoMo 


869M 


Mammary 


NAT 




NAT 




976M 


Ma.ituna.ry 


CAN 


Invasive 

ductal 

carcinoma 


Invasive Ductal 
Carcinoma 


T2N1M0 
(Stage 2B 
Grade 2-3) 


976M 


Mammarv 

LI C-* 1 1 U 1 1 CI -L y 


NAT 




NAT 




S570 


Mammary 


CAN 


Carcinoma 


Carcinoma 


Stage 

IIA?TlNlMo 


Oj /U 


Ma mm 


NAT 




NAT 




S699 


Mammary 


CAN 


Invas ive 

lobular 

carcinoma 


Invasive Lobular 
Carcinoma 


Stage IIB 
G1/T2N1MO 


S699 


Mammary 


NAT 




NAT 




S997 


Mammary 


CAN 


T nvs Q T VP 
XII V aolvC 

ductal 
carcinoma 


Invasive Ductal 
Carcinoma 


Stage IIB 
G3; T2N1MO 


S997 


Mammary 


NAT 




NAT 




G021 


Ovary 


CAN 


Carcinoma 


St. IIIC, poorly 
diff . 


Stage- 
IIIC, 
poorly 
diff. 


G021 


Ovary 


iMAI 




NAT 

Will 






Ovary 


CAN 




papillary serous 

and endometrioid 

ovarian 

carcinoma, 

concurrent 

metastatic 

breast cancer 


3 


1040O 


Ovary 


CAN 




papillary serous 
adeno , 
metastatic 




1050 


Ovary 


CAN 




Papillary Serous 
Carcinoma with 
Focal Mucinous 
Differentiation 


Stage IC 
GO; T1CN0M0 


13 OX 


Ovary 


CAN 




Ovarian cancer 




7180 


Ovary 


CAN 


Adenocarcinoma 


malignant tumor 


IIIC 


A1B 


Ovary 


CAN 


Adenocarcinoma 


CA 




1230 


Ovary 


NRM 




Normal 




18GA 


Ovary 


NRM 




ML 




2061 


Ovary 


NRM 




NL 




3370 


Ovary 


NRM 




Normal 




4 0G 


Ovary 


NRM 




NL 




5150 


Ovary 


NRM 




Normal 




C004 


Ovary 


NRM 




NL 




C177 


Ovary 


NRM 




several fluid 
filled cysts 




4510 


Ovary 


NRM 




Normal Tissue 




7180 


Ovary 


CAN 


Adenocarcinoma 


malignant tumor 


IIIC 


71XL 


Pancrea 
s 


CAN 




villous adenoma 
with paneth cell 
metaplasia 


localized 
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71XL 


Pancrea 
s 


NAT 




NL 




82XP 


Pancrea 
s 


CAN 




serious 
cyst adenoma 




82XP 


Pancrea 
s 


NAT 




NL 




92X 


Pancrea 
s 


CAN 


Ductal 

adenocarcinoma 


ductal 

adenocarcinoma 


mod to 
f ocally 
Doorlv 
diff. 


92X 


Pancrea 
s 


NAT 




NL 




77X 


Pancrea 
s 


CAN 


Hepatic 
adenoma 


Hepatic adenoma 




77X 


Pancrea 
s 


NAT 




NL 




23B 


Prostat 
e 


CAN 




Prostate tumor 


Gleason ' s 
3+4 


23B 


Prostat 
e 


NAT 




NAT 




65XB 


Prostat 
e 


CAN 


Adenocarcinoma 


adenocarc inom 


3+4=7 


65XB 


Prostat 
e 


NAT 




NL 




675P 


Prostat 
e 


CAN 


Adenocarcinoma 


adenocarc inoma 




675P 


Prostat 
e 


NAT 




Normal 




84XB 


Prostat 
e 


CAN 


Adenocarcinoma 


adenocarc inom 


2 + 3 


84XB 


Prostat 
e 


NAT 




NL 




958P 


Prostat 
e 


CAN 


Adenocarcinoma 


Adenocarc i noma 


T2C, NO, MX 


958P 


Prostat 
e 


NAT 


NAT 


Normal 




263C 


Prostat 
e 


BPH 




BPH 




276P 


Prostat 
e 


BPH 




BPH 




767B 


Prostat 
e 


BPH 




prostate BPH 




855P 


Prostat 
e 


BPH 




BPH 




10R 


Prostat 
e 


PROST 




active chronic 
prostatitis 


TO, NO, MO 


2 OR 


Prostat 
e 


PROST 




PROSTATITIS 




287S 


Skin 


CAN 


Squamous cell 
carcinoma 


Invasive 
Keratinizing 
Squamous Cell 
Carcinoma 


Moderately 

Differentia 

ted 


287S 


Skin 


NAT 




NAT 




39A 


Skin 


CAN 




CA 


St. II 


39A 


Skin 


NAT 




CA 


St. II 


669S 


Skin 


CAN 


Melanoma 


Nodular 

malignant 

melanoma 




-669S 


Skin 


NAT 1 


NAT 
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171S 


Small 

x 11 u o J- 

ne 


CAN 


Adenocarcinoma 


Moderately 
differentiated 
Adenocarcinoma , 
invasive 




171S 


Small 

Tn HpeH 

ne 


NAT 




NAT 




20SM 


Small 

Intesti 

ne 


CAN 


Adenocarcinoma 


Adenocarcinoma , 
metastic to lung 
& liver 


St. IV, 

poorly 

diff. 




Small 

Intesti 

ne 


MAT 




NAT 




H89 


Smal 1 
Inters t i 
ne 


CAN 


Adenocarcinoma 


Adenocarcimoa 


80% tumor, 
50% 

necrosis , 
moderately 
differentia 
ted, G2-3; 
T3N1MX 


H89 


Small 

Intesti 

ne 


NAT 


Adenocarcinoma 


NAT 




261S 


Stomach 


CAN 


Signet-ring 
cell carcinoma 


Signet -ring cell 
carcinoma 


Stage IIIA, 
T3N1M0 


261S 


Stomach 


NAT 




NAT 




288S 


Stomach 


CAN 


Adenocar c inoma 


Infiltrating 
Adneocarc inoma 


Moderately 

Differentia 

ted 


288S 


O l— UlllO.V_,ll 


NAT 




NAT 




AC93 

02T 

509L 


Stomach 


CAN 


Adenocarcinoma 


Adenocar c i noma 


St. IV, G4, 
T4N3M0, 
poorly 
diff. 


AC93 
or 

509L 


Stomach 


NAT 




NAT 




88S 


Stomach 


CAN 


Adenocarcinoma 


Mucinous 
adenocarcinoma 


T3N1M0, St. 
IIIA 


88S 


Stomach 


NAT 




NAT 




143N 


Thyroid. 
Gland 


CAN 


Follicular 
carcinoma 


Follicular 
Carcinoma 




143N 


Thvrni ci 
Gland 


NAT 




NAT 




270T 


Thyroid 
Gland 


CAN 




CA 




270T 


Thyroid 
Gland 


NAT 




NAT 




56T 


Thyroid 
Gland 


CAN 


Papillary 
carcinoma 


Papillary 
Carcinoma 


St. Ill; 
T4N1M0 


56T 


Thyroid 
Gland 


NAT 




NAT 




39X 


Testes 


CAN 




CA 




39X 


Testes 


NAT 




NAT 




647T 


Testes 


CAN 


Teratocarcinom 
a 


Teratocarcinoma 


Stage I A 


647T 


Testes 


NAT 


Teratocarcinom 
a 


NAT 
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663T 


Testes 


CAN 


Teratocarc inom 
a 


Teratocarcinoma 




663T 


Testes 


NAT 




NAT 




135XO 


Uterus 


CAN 




Uterus normal 




135X0 


Uterus 


NAT 




Uterus tumor 




85XU 


Uterus 


CAN 




endometrial 
carcinoma 


I 


85XU 


Uterus 


NAT 




NL 




Bl 


Blood 


NRM 




Normal 




B3 


Blood 


NRM 




Normal 




B5 


Blood 


NRM 




Normal 




B6 


Blood 


NRM 




Normal 




Bll 


Blood 


NRM 




Normal 




982B 


Blood 


NRM 




Normal 




B69 


Blood 


NRM 




Normal 




B72 


Blood 


NRM 




Normal 




B73 


Blood 


NRM 




Normal 




B75 


Blood 


NRM 




Normal 




4 8 AD 


Adrenal 
Gland 


NRM 




Normal 




10BR 


Brain 


NRM | 




Normal 




01CL 


Colon 


NRM 




Normal 




06CV 


Cervix 


NRM ' 




Normal 




01ES 


Esophag 
us 


NRM 




Normal 




46HR 


Heart 


NRM 




Normal 




OOHR 


Human 
Ref eren 
ce 


CAN 


CAN 


Cancer pool 




55KD 


Kidney 


NRM 




Normal 




89LV 


Liver 


NRM 




Normal 




90LN 


Lung 


NRM ! 




Normal 




01MA 


Mammary 


NRM 




Normal 




84MU 


Skeleta 
1 

Muscle 


NRM 




Normal 




3APV 


Ovary 


NRM 




Normal 




04PA 


Pancrea 
s 


NRM 




Normal 




59PL 


Placent 
a 


NRM 




Normal 




09PR 


Prostat 
e 


NRM 




Normal 




2 IRC 


Rectum 


NRM 




Normal 




59SM 


Small 

Intesti 

ne 


NRM ! 




Normal 




7GSP 


Spleen 


NRM 




Normal 




09ST 


Stomach 


NRM 




Normal 




4 GTS 


Testes 


NRM 




Normal 




99TM 


Thymus 
Gland 


NRM 




Normal 




16TR 


Trachea 


NRM 




Normal 




57UT 


Uterus 


NRM 




Normal 
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The relative expression level of Cln260 in various tissue samples is included below. 
Tissue samples include 78 pairs of matching samples, 7 non matched cancer samples, and 
34 normal samples, all from various tissues annotated in the table. A matching pair is 
formed by mRNA from the cancer sample for a particular tissue and mRNA from the 
5 normal adjacent sample for that same tissue from the same individual. Of the normal 
samples 4 were blood samples which measured the expression levels in blood cells. 
Additionally, 2 prostatitis, and 4 Benign Prostatic Hyperplasia (BPH) samples are 
included. All the values are compared to colon cancer sample CLNC9XR (calibrator). 
The table below contains the relative expression level values for the sample as 
10 compared to the calibrator. The table includes the Sample ID and expression level values 
for the following samples: Cancer (CAN), Normal Adjacent Tissue (NAT), Normal Tissue 
(NRM), Benign Prostatic Hyperplasia (BPH), and Prostatitis (PROST). 



Sample ID || CAN 


[Ijat 


NRM 


BPH || PROST 


CLNAS12 


0.28 


[0.60 






CLNAS46 


1.08 


|0.07 






CLNB34 


0.52 


[o.oo 






CLNC9XR 


1.00 


jo. 36 






CLNCM67 ||0.99 


|0.78 








CLNTX89 | 


1.14 


[1-87 






CLNAS43 


0.00 


|o.oo 






CLNAS98 j 


0.00 


|o.oo 






CLNRS53 j 


4.88 


|6.41 






CLNRC01 1)5.96 


|l.53 






CLNSG27 ||0.00 


|l.92 






CLNDC19 ||6.28 


jo.oo 






CLN401C ~ |[4.67 


|4 .98 






CLNCM12 jfl , 2 9 


[o.oo 








CLNTX01 1(3.99 


[o . 00 






BLD030B j|0.90 


|o . 00 






BLD520B ||0.00 


|0.67 






BLDTR17 ]|0.00 


jo. 79 






CVXKS52 |0.00 


jl.28 






CVXNK23 ||0.73 


jo.oo 






CVXNKS54 Jo.OO 


[o.oo 






CVXNKS55 ||0.00 


Jo.oo 






CVXNKS81 IJO.OO 


jo.oo 








ENDO10479 


0.00 


[6.68 








END028XA 


0.00 


[2 . 17 






END08XA 


0.00 


|0.16 






KID106XD 


0.00 


|o.oo 






KID107XD 


4.62 


jo.oo 






JKID109XD 


1.15 


jo.oo 
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KID10XD 


6.52 


|0.00 


II 




KID22K 


0.11 


jo. 12 


II 




LNG205L 


0.00 
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0.00= Negative or Not Detected 



The sensitivity for Cln260 expression was calculated for the cancer samples versus 
normal samples. The sensitivity value indicates the percentage of cancer samples that 
5 show levels of Cln260 at least 2 fold higher than the normal tissue or the corresponding 
normal adjacent form the same patient. 

This specificity is an indication of the level of colon tissue specific expression of 
the transcript compared to all the other tissue types tested in our assay. Thus, these 
experiments indicate Cln260 being useful as a colon cancer diagnostic marker and/or 
10 therapeutic target. 



Sensitivity and specificity data is reported in the table below. 





CLN 


LNG 


MAM 


OVR 


PRO 


Sensitivity, Up vs. NAT 


47% 


22% 


22% 


0% 


0% 


Sensitivity, Down vs. 
NAT 


13% 


11% 
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40% 


Sensitivity, Up vs. NRM 


33% 


11% 


22% 


43% 


0% 


Sensitivity, Down vs. 
NRM 


27% 


78% 


0% 


0% ' 


100% 


Specificity 


70.93 
% 


64.13 
% 


61.96 

% 


64.52 
% 


63.98 

% 



Altogether, the tissue specificity, plus the mRNA differential expression in the 
samples tested are believed to make Cln260 a good marker for diagnosing, monitoring, 
1 5 staging, imaging and/or treating colon cancer. 

Additionally, the tissue specificity, plus the mRNA differential expression in the 
samples tested are believed to make Cln260 a good marker for diagnosing, monitoring, 
staging, imaging and/or treating ovarian cancer. 

20 Primers used for QPCR Expression Analysis of Cln260 are as follows: 

(Cln260_forward): GGCCAACTCTTTTTACTCCTTCATT (SEQ ID NO:249) 
(Cln260_reverse): CAGGAGCATCTCCGTTTTCATT (SEQ ID NO:250) 
(Cln260_probe): TCCAGTAGTTGGGCAGTGCTGGCA (SEQ ID NO:251) 
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The relative expression level of Cln261 in various tissue samples is included below. 
Tissue samples include 78 pairs of matching samples, 8 non matched cancer samples, and 
36 normal samples, all from various tissues annotated in the table. A matching pair is 
formed by mRNA from the cancer sample for a particular tissue and mRNA from the 
5 normal adjacent sample for that same tissue from the same individual Of the normal 
samples 5 were blood samples which measured the expression levels in blood cells. 
Additionally, 2 prostatitis, and 4 Benign Prostatic Hyperplasia (BPH) samples are 
included. All the values are compared to normal breast sample MamOlMA (calibrator). 

The table below contains the relative expression level values for the sample as 
10 compared to the calibrator. The table includes the Sample ID and expression level values 
for the following samples: Cancer (CAN), Normal Adjacent Tissue (NAT), Normal Tissue 
(NRM), Benign Prostatic Hyperplasia (BPH), and Prostatitis (PROST). 



Sample ID jj CAN |j NAT |j NRM 


BPH | PR T ° S 


CLNAS12 111- 3 0 110.14 jP 


V 


CLNAS46 ||0.55 ||0.69|L I 


II 


CLNB34 IJo . 89 ||0 . 00 || 


II 


CLNCM67 111- 18 lb . 14 II 


II 


CLNDC22 ||0.31 ||3 .01 || 


II 


CLNTX89 J|0.81 ||0.21 |J 


II 


CLN4 01C ||0 - 65 ||0.38 |f 


II 


CLNAS43 j|l-56 j[0.72 |[__ 


II 


CLNAS98 Jjo . 14 ||0 . 19 j|_ 


1 


CLNCM12 ||0. 34 ]|o . 06 || 


II 


CLNDC19 JO. 89 J|0.40j(_ 


II 


CLNRC01 jjl-54 ||0.28 |j 


1' 1 


CLNRS53 Jj2.47 ||l.29|| 


1 II 


CLNSG27 ||l^62 ||0.27 |j 


1 II 


CLNTX01 Jo. 85 |0.85 || 


1 II 


BLD030B ||2.56 jjl.55 ||_ 


1 II 


BLD520B ||o.48 ||l . 04 |[^ 


1 II 


BLDTR17 J|l.50 ||0.71 |Q 


1 II 


|CVXKS52 ||l.87 ||0.34 |[_ 


1 II 


|CVXNKS55 ||l. 11 ||l. 04 |f 


i n 


|CVXNKS25 _||2.45 ]|2.6l|L 


i ii 


|CVXNKS18 ||0.06 ||0.3 2 |[~~ 


i ii 


|CVXNKS54 ||2.53 ||l.33||__ 


i ii 


|ENDO104 79 j|l6.57 j|2.34 || 


i i 


|END028XA ||3.15 ||l.37 || 


i « 


|END08XA ||0.32 ||2.2l|| 


i ii 


|KID106XD ||0.09 ||0.10|| 


i ii 


|KID12XD ||0.14 ||l. 54 || 


i « 



WO 2004/050858 



PCT/US2003/038808 



249 



|kidioxd |o.n 


|0.04 








|KID22K 


0.23 ||0.14 








[KID107XD 


0.42 


|0.18 








jLNG205L 


2.84 


|o.22 








|LNG315L 


2.27 


[l . 88 








|LNG507L 


1.74 


jo. 99 








|LNG52 8L 


7. 04 


\2 . 07 








[LNG8837L 


4 .48 


|0.98 








|LNGAC11 


1.78 


(l - 93 








|LNGAC39 


11.41 


|2 .38 








|LNGSQ80 


3.11 


|l.94 








|LNGSQ81 


4.22 


|3 .14 








|LVR15XA 


0.16 


|0.06 








|LVR174L 


0.01 


|0.05 








[LVR187L 


0. 06 


[6.50 








|MAM19DN 


0.90 


|0 . 4 2 








[MAM4 2DN 


1.21 


|0 . 65 








|MAM517 


0. 99 


|0 .07 








(MAM781M 


0.67 


|0.29 








|MAM869M 


0.62 


|0.13 








[MAM976M 


0.33 


|5. 39 








(MAMS570 


0.08 


|0.86 








[MAMS699 


0.66 


|0_.88 








|MAMS997 


3 .12 


|l . 05 








|OVRG021 


0.71 


|3.06 








(OVR206I 






1.06 






joVR5150 






0.52 






|0VR18GA 






0.73 






|OVR33 70 






0.65 






|0VR1230 






0.00 






(OVRC177 






0.57 






|OVR40G 






1.11 






[OVR1005O 


0.61 










[OVR104 0O 


4 . 19 










|OVR105O 


0.53 










(OVR130X 


3.60 










|OVR4510 






0.90 






[OVR7180 


1.88 










[OVRAIB 


3.31 




1 






|PAN71XL 


0.40 


|0.40 


1 






jpAN77X 


0.09 




1 






|PAN92X 1 


0.21 


[0.71 


1 


1 




|PRO10R 










3 .68 


|PRO20R 










2.16 


|PR023B 


0.07 


|0.12 








|PR0263C 






1 


1.40 




|PR0276P 






1 


:0.08 





■i 



WO 2004/050858 



PCT/US2003/038808 



250 



PR065XB 


0.03 


0.07 








PR0675P 


0.13 


0.09 








PR0767B 








1.33 




PR084XB 


0.10 


0.31 








PR0855P 








0.05 




PR0958P 


0.09 


0.06 








SKN287S 


0.27 


0.21 








SKN3 9A 


0.24 


0.16 








SKN669S 


0.45 


0.10 








SMINT171S 


0.16 


0.08 








SMINT20SM 


0.16 


0.11 








SMINTH89 


0.60 


0.17 








ST0261S 


0.27 


0.24 








ST0288S 


0.30 


0.11 








STOAC93 


0.26 


0.50 








ST08 8S 


0.29 


0.03 








THRD143N 


9.86 


4 .19 








THRD27 0T 


3.59 


2 .98 








THRD56T 


0.29 


0.20 








TST39X 


12 .51 


9.60 








TST647T 


17.27 


3 .65 








TST663T 


13 .34 


5.10 








UTR135XO 


7.21 


5 .38 








UTR85XU 


19.32 


8 .35 








BLOB 3 






0.02 






BLOB 11 






0.53 






BL069 






0.13 






BL072 






0.19 






BL073 






0.43 






ADR 4 8 AD 






0.06 






BRN10BR 






0.33 






CLN01CL 






0. 14 






CVX06CV 






1.28 






ESO01ES 






2.01 






HRT46HR 






0.66 






HUMREF00HR 


2.61 










KID55KD 






0 .48 






LVR89LV 






0.05 






LNG90LN 






14.3 
5 






MAM 0 IMA 






1.00 






MSL84MU 






1.62 






OVR3APV 






4.73 






PAN04PA 






0.23 






PLA59PL 






1.23 






PRO09PR j 






1.09 






REC21RC 






0.22 







WO 2004/050858 



PCT/US200J/038808 



251 



|SMINT59SM || 


llo.xsll II 


|SPL7GSP |[ 


1.74 11 « 


|sTO09ST || 


IJ0.33 || 


1 


ITHYM99TM 




||0.S4 || 




TRA16TR 
TST4GTS | 




110.97 f~"~\ 


1 


1 1 




|2.46|| 


1 


UTR57UT | 

A A f\ IT . 


11 


|1.28 || 


1 



10 



The sensitivity for Cln261 expression was calculated for the cancer samples versus 
normal samples. The sensitivity value indicates the percentage of cancer samples that 
show levels of Cln261 at least 2 fold higher than the normal tissue or the corresponding 
normal adjacent form the same patient. 

This specificity is an indication of the level of colon tissue specific expression of 
the transcript compared to all the other tissue types tested in our assay. Thus, these 
experiments indicate Cln261 being useful as a colon cancer diagnostic marker and/or 
therapeutic target. 



15 
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Altogether, the tissue specificity, plus the mRNA differential expression in the 
samples tested are believed to make Cln261 a good marker for diagnosing, monitoring, 
staging, imaging and treating colon cancer. 

Primers used for QPCR Expression Analysis of Cln261 are as follows: 
(Cln261_forward): TCGGCAGACATGTGCATTG (SEQ ID NO:252) 
(Cln261_reverse): CGTTGCTTGTACGCTCCGTAA (SEQ ID NO:253) 
(Cln261_probe): CATTGCGATTTCTCTTCTCATGATCCTGATATG (SEQ ID NO:254) 

Conclusions 
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Altogether, the high level of tissue specificity, plus the mRNA overexpression in 
matched samples tested are indicative of SEQ ID NO: 1-94 being a diagnostic marker 
and/or a therapeutic target for cancer. 

Example 3: Protein Expression 
5 The CSNA is amplified by polymerase chain reaction (PCR) and the amplified 

DNA fragment encoding the CSNA is subcloned in pET-21d for expression in E. coli. In 
addition to the CSNA coding sequence, codons for two amino acids, Met- Ala, flanking the 
NH 2 -terminus of the coding sequence of CSNA, and six histidines, flanking the 
COOH-terminus of the coding sequence of CSNA, are incorporated to serve as initiating 
10 Met/restriction site and purification tag, respectively. 

An over-expressed protein band of the appropriate molecular weight may be 
observed on a Coomassie blue stained polyacrylamide gel. This protein band is confirmed 
by Western blot analysis using monoclonal antibody against 6X Histidine tag. 

Large-scale purification of CSP is achieved using cell paste generated from 6-liter 
15 bacterial cultures, and purified using immobilized metal affinity chromatography (IMAC). 
Soluble fractions that are separated from total cell lysate were incubated with a nickel 
chelating resin. The column is packed and washed with five column volumes of wash 
buffer. CSP is eluted stepwise with various concentration imidazole buffers. 

Example 4: Fusion Proteins 

20 The human Fc portion of the IgG molecule can be PCR amplified, using primers 

that span the 5'and 3' ends of the sequence described below. These primers also should 
have convenient restriction eti2yme sites that will facilitate cloning into an expression 
vector, preferably a mammalian expression vector. For example, if pC4 (Accession No. 
209646) is used, the human Fc portion can be ligated into the BamHI cloning site. Note 

25 that the 3 5 BamHI site should be destroyed. Next, the vector containing the human Fc 
portion is re-restricted with BamHI, linearizing the vector, and a polynucleotide of the 
present invention, isolated by the PCR protocol described in Example 2, is ligated into this 
BamHI site. Note that the polynucleotide is cloned without a stop codon, otherwise a 
fusion protein will not be produced. If the naturally occurring signal sequence is used to 

30 produce the secreted protein, pC4 does not need a second signal peptide. Alternatively, if 
the naturally occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. See, e.g., WO 96/34891. 
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Example 5: Production of an Antibody from a Polypeptide 

In general, such procedures involve immunizing an animal (preferably a mouse) 
with polypeptide or, more preferably, with a secreted polypeptide-expressing cell. Such 
cells may be cultured in any suitable tissue culture medium; however, it is preferable to 
5 culture cells in Earle's modified Eagle's medium supplemented with 10% fetal bovine 
serum (inactivated at about 56°C), and supplemented with about 10 g/1 of nonessential 
amino acids, about 1,000 U/ml of penicillin, and about 100, jig/ml of streptomycin. The 
splenocytes of such mice are extracted and fused with a suitable myeloma cell line. Any 
suitable myeloma cell line may be employed in accordance with the present invention; 

10 however, it is preferable to employ the parent myeloma cell line (SP20), available from 
the ATCC. After fusion, the resulting hybridoma cells are selectively maintained in HAT 
medium, and then cloned by limiting dilution as described by Wands et al, 
Gastroenterology 80: 225-232 (1981). 

The hybridoma cells obtained through such a selection are then assayed to identify 

15 clones which secrete antibodies capable of binding the polypeptide. Alternatively, 

additional antibodies capable of binding to the polypeptide can be produced in a two-step 
procedure using anti-idiotypic antibodies. Such a method makes use of the fact that 
antibodies are themselves antigens, and therefore, it is possible to obtain an antibody 
which binds to a second antibody. In accordance with this method, protein specific 

20 antibodies are used to immunize an animal, preferably a mouse. The splenocytes of such 
an animal are then used to produce hybridoma cells, and the hybridoma cells are screened 
to identify clones which produce an antibody whose ability to bind to the protein-specific 
antibody can be blocked by the polypeptide. Such antibodies comprise anti-idiotypic 
antibodies to the protein specific antibody and can be used to immunize an animal to 

25 induce formation of further protein-specific antibodies. 

Example 6: Method of Determining Alterations in a Gene Corresponding to a 
Polynucleotide 

RNA is isolated from individual patients or from a family of individuals that have 
a phenotype of interest. cDNA is then generated from these RNA samples using protocols 
30 known in the art. See, Sambrook (2001), supra. The cDNA is then used as a template for 
PCR, employing primers surrounding regions of interest in SEQ ID NO: 1-94. Suggested 
PCR conditions consist of 35 cycles at 95°C for 30 seconds; 60-120 seconds at 52-58°C; 
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and 60-120 seconds at 70°C, using buffer solutions described in Sidransky et al., Science 
252(5006): 706-9 (1991). See also Sidransky et al, Science 278(5340): 1054-9 (1997). 

PCR products are then sequenced using primers labeled at their 5' end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). The 
5 intron-exon borders of selected exons are also determined and genomic PCR products 
analyzed to confirm the results. PCR products harboring suspected mutations are then 
cloned and sequenced to validate the results of the direct sequencing. PCR products is 
cloned into T-tailed vectors as described in Holton et aL, Nucleic Acids Res., 19: 1 156 
(1991) and sequenced with T7 polymerase (United States Biochemical). Affected 

10 individuals are identified by mutations not present in unaffected individuals. 

Genomic rearrangements may also be determined. Genomic clones are 
nick-translated with digoxigenin deoxyuridine 5' triphosphate (Boehringer Manheim), and 
FISH is performed as described in Johnson et al, Methods Cell Biol. 35: 73-99 (1991). 
Hybridization with the labeled probe is carried out using a vast excess of human cot-1 

15 DNA for specific hybridization to the corresponding genomic locus, 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and propidium 
iodide, producing a combination of C-and R-bands. Aligned images for precise mapping 
are obtained using a triple-band filter set (Chroma Technology, Brattleboro, VT) in 
combination with a cooled charge-coupled device camera (Photometries, Tucson, AZ) and 

20 variable excitation wavelength filters. Johnson (1991). Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of the 
genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated disease. 

25 Example 7: Method of Detecting Abnormal Levels of a Polypeptide in a Biological 
Sample 

Antibody-sandwich ELISAs are used to detect polypeptides in a sample, preferably 
a biological sample. Wells of a microliter plate are coated with specific antibodies, at a 
final concentration of 0.2 to 10 ug/ml. The antibodies are either monoclonal or polyclonal 
30 and are produced by the method described above. The wells are blocked so that 

non-specific binding of the polypeptide to the well is reduced. The coated wells are then 
incubated for > 2 hours at RT with a sample containing the polypeptide. Preferably, serial 
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dilutions of the sample should be used to validate results. The plates are then washed 
three times with deionized or distilled water to remove unbound polypeptide. Next, 50 \s\ 
of specific antibody-alkaline phosphatase conjugate, at a concentration of 25-400 ng, is 
added and incubated for 2 hours at room temperature. The plates are again washed three 
5 times with deionized or distilled water to remove unbound conjugate. 75 pi of 
4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl phosphate (NPP) substrate 
solution are added to each well and incubated 1 hour at room temperature. 

The reaction is measured by a microliter plate reader. A standard curve is 
prepared, using serial dilutions of a control sample, and polypeptide concentrations are 
10 plotted on the X-axis (log scale) and fluorescence or absorbance on the Y-axis (linear 

scale). The concentration of the polypeptide in the sample is calculated using the standard 
curve. 

Example 8: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a fashion 

15 consistent with good medical practice, taking into account the clinical condition of the 
individual patient (especially the side effects of treatment with the secreted polypeptide 
alone), the site of delivery, the method of administration, the scheduling of administration, 
and other factors known to practitioners. The "effective amount" for purposes herein is 
thus determined by such considerations. 

20 As a general proposition, the total pharmaceutically effective amount of secreted 

polypeptide administered parenterally per dose will be in the range of about 1, pg/kg/day 
to 10 mg/kg/day of patient body weight, although, as noted above, this will be subject to 
therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and most 
preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If given 

25 continuously, the secreted polypeptide is typically administered at a dose rate of about 1 
pg/kg/hour to about 50 mg/kg/hour, either by 1-4 injections per day or by continuous 
subcutaneous infusions, for example, using a mini-pump. An intravenous bag solution 
may also be employed. The length of treatment needed to observe changes and the 
interval following treatment for responses to occur appears to vary depending on the 

30 desired effect 

Pharmaceutical compositions containing the secreted protein of the invention are 
administered orally, rectally, parenterally, intracistemally, intravaginally, intraperitoneal^, 
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topically (as by powders, ointments, gels, drops or transdermal patch), bucally, or as an 
oral or nasal spray. "Pharmaceutically acceptable carrier" refers to a non-toxic solid, 
semisolid or liquid filler, diluent, encapsulating material or formulation auxiliary of any 
type. The term "parenteral" as used herein refers to modes of administration which 
5 include intravenous, intramuscular, intraperitoneal, intrasternal, subcutaneous and 
intraarticular injection and infusion 

The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semipermeable 
polymer matrices in the form of shaped articles, e.g., films, or microcapsules. Sustained- 
10 release matrices include polylactides (U. S. Pat. No.3,773,919, EP 58,481, the contents of 
which are hereby incorporated by reference herein in their entirety), copolymers of L- 
glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et al, Biopolymers 22: 547-556 
(1983)), poly (2-hydroxyethyl methacrylate) (R Langer et al., J. Biomed. Mater. Res. 15: 
167-277 (1981), and R. Langer, Chem. Tech. 12: 98-105 (1982)), ethylene vinyl acetate 
(R. Langer et al.) or poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained-release 
compositions also include liposomally entrapped polypeptides. Liposomes containing the 
secreted polypeptide are prepared by methods known per se: DE Epstein et al, Proc. Natl. 
Acad. Sci. USA 82: 3688-3692 (1985); Hwang et al., Proc. Natl. Acad. Sci. USA 77: 
4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese 
20 Pat Appl. 83-1 18008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324, the 

contents of which are hereby incorporated by reference herein in their entirety. Ordinarily, 
the liposomes are of the small (about 200-800 Angstroms) unilamellar type in which the 
lipid content is greater than about 30 mol. percent cholesterol, the selected proportion 
being adjusted for the optimal secreted polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically acceptable 
carrier, i.e., one that is non-toxic to recipients at the dosages and concentrations employed 
and is compatible with other ingredients of the formulation. 

For example, the formulation preferably does not include oxidizing agents and 
other compounds that are known to be deleterious to polypeptides. Generally, the 
formulations are prepared by contacting the polypeptide uniformly and intimately with 
liquid carriers or finely divided solid carriers or both. Then, if necessary, the product is 
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shaped into the desired formulation. Preferably, the carrier is a parenteral carrier, more 
preferably, a solution that is isotonic with the blood of the recipient. Examples of such 
carrier vehicles include water, saline, Ringer's solution, and dextrose solution. Non- 
aqueous vehicles such as fixed oils and ethyl oleate are also useful herein, as well as 
5 liposomes. 

The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at the 
dosages and concentrations employed, and include buffers such as phosphate, citrate, 
succinate, acetic acid, and other organic acids or their salts; antioxidants such as ascorbic 

10 acid; low molecular weight (less than about ten residues) polypeptides, e. g., polyarginine 
or tripeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, 
aspartic acid, or arginine; monosaccharides, disaccharides, and other carbohydrates 
including cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such 

15 as EDTA; sugar alcohols such as mannitol or sorbitol; counterfoils such as sodium; and/or 
nonionic surfactants such as polysorbates, poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a concentration 
of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 3 to 8. It will 
be understood that the use of certain of the foregoing excipients, carriers, or stabilizers 

20 will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. Sterility 
is readily accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron 
membranes). Therapeutic polypeptide compositions generally are placed into a container 
having a sterile access port, for example, an intravenous solution bag or vial having a 

25 stopper pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampules or vials, as an aqueous solution or as a lyophilized formulation 
for reconstitution. As an example of a lyophilized formulation, 10-ml vials are filled with 
5 ml of sterile-filtered 1 % (w/v) aqueous polypeptide solution, and the resulting mixture 

30 is lyophilized. The infusion solution is prepared by reconstituting the lyophilized 
polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the pharmaceutical compositions 
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of the invention. Associated with such container (s) can be a notice in the form prescribed 
by a governmental agency regulating the manufacture, use or sale of pharmaceuticals or 
biological products, which notice reflects approval by the agency of manufacture, use or 
sale for human administration. In addition, the polypeptides of the present invention may 
5 be employed in conjunction with other therapeutic compounds. 

Example 9: Method of Treating Decreased Levels of the Polypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 

normal expression level of a secreted protein in an individual can be treated by 

administering the polypeptide of the present invention, preferably in the secreted form. 
10 Thus, the invention also provides a method of treatment of an individual in need of an 

increased level of the polypeptide comprising administering to such an individual a 

pharmaceutical composition comprising an amount of the polypeptide to increase the 

activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily dose 
15 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the polypeptide is 

in the secreted form. The exact details of the dosing scheme, based on administration and 

formulation, are provided above. 

Example 10: Method of Treating Increased Levels of the Polypeptide 

Antisense or RNAi technology are used to inhibit production of a polypeptide of 
20 the present invention. This technology is one example of a method of decreasing levels of 
a polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1 .5, 2.0 
and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest period if the 
25 treatment was well tolerated. The formulation of the antisense polynucleotide is provided 
above. 

Example 11: Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
30 subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 

separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
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tissue culture flask, approximately ten pieces are placed in each flask. The flask is turned 
upside down, closed tight and left at room temperature over night. After 24 hours at room 
temperature, the flask is inverted and the chunks of tissue remain fixed to the bottom of 
the flask and fresh media (e. g., Ham f s F12 media, with 10% FBS, penicillin and 
5 streptomycin) is added. The flasks are then incubated at 37°C for approximately one week. 
At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks. pMV-7 (Kirschmeier, P. T. et al., 
DNA, 7: 219-25 (1988)), flanked by the long terminal repeats of the Moloney murine 
10 sarcoma virus, is digested with EcoRI and Hindm and subsequently treated with calf 

intestinal phosphatase. The linear vector is fractionated on agarose gel and purified, using 
glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified using 
PCR primers which correspond to the 5 f and 3 f end sequences respectively as set forth in 

15 Example 3. Preferably, the 5 ! primer contains an EcoRI site and the 3'primer includes a 
Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear backbone and 
the amplified EcoRI and Hindm fragment are added together, in the presence of T4 DNA 
ligase. The resulting mixture is maintained under conditions appropriate for ligation of the 
two fragments. The ligation mixture is then used to transform bacteria HB 101, which are 

20 then plated onto agar containing kanamycin for the purpose of confirming that the vector 
has the gene of interest properly inserted. 

The amphotropic pA3 17 or GP+aml2 packaging cells are grown in tissue culture to 
confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% calf serum 
(CS), penicillin and streptomycin. The MSV vector containing the gene is then added to 

25 the media and the packaging cells transduced with the vector. The packaging cells now 
produce infectious viral particles containing the gene (the packaging cells are now referred 
to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the media 
is harvested from a 10 cm plate of confluent producer cells. The spent media, containing 
30 the infectious viral particles, is filtered through a millipore filter to remove detached 
producer cells and this media is then used to infect fibroblast cells. Media is removed 
from a sub-confluent plate of fibroblasts and quickly replaced with the media from the 
producer cells. This media is removed and replaced with fresh media. 
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If the titer of virus is high, then virtually all fibroblasts will be infected and no 
selection is required. If the titer is very low, then it is necessary to use a retroviral vector 
that has a selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
infected, the fibroblasts are analyzed to determine whether protein is produced. 
5 The engineered fibroblasts are then transplanted onto the host, either alone or after 

having been grown to confluence on cytodex 3 microcarrier beads. 

Example 12: Method of Treatment Using Gene Therapy-In Vivo 

Another aspect of the present invention is using in vivo gene therapy methods to 
treat disorders, diseases and conditions. The gene therapy method relates to the 

10 introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) sequences 
into an animal to increase or decrease the expression of the polypeptide. 

The polynucleotide of the present invention may be operatively linked to a 
promoter or any other genetic elements necessary for the expression of the polypeptide by 
the target tissue. Such gene therapy and delivery techniques and methods are known in 

15 the art, see, for example, Tabata H. et al Cardiovasc. Res. 35 (3): 470-479 (1997); Chao J 
etal. Phaimacol. Res. 35 (6): 517-522 (1997); Wolff J. A. Neuromuscul Disord. 7 (5): 
314-318 (1997), Schwartz B. et al Gene Ther. 3 (5): 405-411 (1996); and Tsurumi Y. et 
al Circulation 94 (12): 3281-3290 (1996); W0 90/1 1092, W0 98/1 1779; U. S. Patent No. 
5,693,622; 5,705,151; 5,580,859, the contents of which are hereby incorporated by 

20 reference herein in their entirety. 

The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial space of 
tissues (heart, muscle, skin, colon, liver, intestine and the like). The polynucleotide 
constructs can be delivered in a pharmaceutical^ acceptable liquid or aqueous carrier. 

25 The term "naked" polynucleotide, DNA or RNA, refers to sequences that are free 

from any delivery vehicle that acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome formulations, lipofectin or precipitating 
agents and the like. However, the polynucleotides of the present invention may also be 
delivered in liposome formulations (such as those taught in Feigner P. L. et al Ann. NY 

30 Acad. Sci. 772: 126-139 (1995) and Abdallah B. et al Biol Cell 85 (1): 1-7 (1995)) which 
can be prepared by methods well known to those skilled in the art. 
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The polynucleotide vector constructs used in the gene therapy method are 
preferably constructs that will not integrate into the host genome nor will they contain 
sequences that allow for replication. Any strong promoter known to those skilled in the art 
can be used for driving the expression of DNA. Unlike other gene therapies techniques, 
5 one major advantage of introducing naked nucleic acid sequences into target cells is the 
transitory nature of the polynucleotide synthesis in the cells. Studies have shown that non- 
replicating DNA sequences can be introduced into cells to provide production of the 
desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of tissues 

10 within the an animal, including of muscle, skin, brain, colon, liver, spleen, bone marrow, 
thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, stomach, 
intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and connective tissue. 
Interstitial space of the tissues comprises the intercellular fluid, mucopolysaccharide 
matrix among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 

15 chambers, collagen fibers of fibrous tissues, or that same matrix within connective tissue 
ensheathing muscle cells or in the lacunae of bone. It is similarly the space occupied by 
the plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery to 
the interstitial space of muscle tissue is preferred for the reasons discussed below. They 
may be conveniently delivered by injection into the tissues comprising these cells. They 

20 are preferably delivered to and expressed in persistent, non-dividing cells which are 

differentiated, although delivery and expression may be achieved in non-differentiated or 
less completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take up and 
express polynucleotides. 

25 For the naked polynucleotide injection, an effective dosage amount of DNA or 

RNA will be in the range of from about 0.05 ^ig/kg body weight to about 50 mg/kg body 
weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and 
more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of 
ordinary skill will appreciate, this dosage will vary according to the tissue site of injection. 

30 The appropriate and effective dosage of nucleic acid sequence can readily be determined 
by those of ordinary skill in the art and may depend on the condition being treated and the 
route of administration. The preferred route of administration is by the parenteral route of 
injection into the interstitial space of tissues. However, other parenteral routes may also 
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be used, such as, inhalation of an aerosol formulation particularly for delivery to colons or 
bronchial tissues, throat or mucous membranes of the nose. In addition, naked 
polynucleotide constructs can be delivered to arteries during angioplasty by the catheter 
used in the procedure. 
5 The dose response effects of injected polynucleotide in muscle in vivo is 

determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard 
recombinant DNA methodology. The template DNA, which may be either circular or 
linear, is either used as naked DNA or complexed with liposomes. The quadriceps 

10 muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on the 
anterior thigh, and the quadriceps muscle is directly visualized. The template DNA is 
injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over one minute, 

15 approximately 0.5 cm from the distal insertion site of the muscle into the knee and about 
0.2 cm deep. A suture is placed over the injection site for future localization, and the skin 
is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are prepared by 
excising the entire quadriceps. Every fifth 15 um cross-section of the individual 

20 quadriceps muscles is histochemically stained for protein expression. A time course for 
protein expression may be done in a similar fashion except that quadriceps from different 
mice are harvested at different times. Persistence of DNA in muscle following injection 
may be determined by Southern blot analysis after preparing total cellular DNA and HIRT 
supernatants from injected and control mice. 

25 The results of the above experimentation in mice can be use to extrapolate proper 

dosages and other treatment parameters in humans and other animals using naked DNA. 

Example 13: Transgenic Animals 

The polypeptides of the invention can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, guinea 
30 pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e. g., baboons, 
monkeys, and chimpanzees may be used to generate transgenic animals. In a specific 



WO 2004/050858 



PCT7US2003/038808 



263 

embodiment, techniques described herein or otherwise known in the art, are used to 
express polypeptides of the invention in humans, as part of a gene therapy protocol. 

Any technique known in the art may be used to introduce the.transgene (I. e., 
polynucleotides of the invention) into animals to produce the founder lines of transgenic 
5 animals. Such techniques include, but are not limited to, pronuclear microinjection 
(Paterson et al., Appl. Microbiol Biotechnol. 40: 691-698 (1994); Carver et al., 
Biotechnology 11: 1263-1270 (1993); Wright et al., Biotechnology 9: 830-834 (1991); and 
U. S. Pat. No. 4,873,191, the contents of which is hereby incorporated by reference herein 
in its entirety); retrovirus mediated gene transfer into germ lines (Van der Putten et al., 

10 Proc. Natl Acad. Set, USA 82: 6148-6152 (1985)), blastocysts or embryos; gene targeting 
in embryonic stem cells (Thompson et al., Cell 56: 313-321 (1989)); electroporation of 
cells or embryos (Lo, 1983, Mol Cell Biol 3: 1803-1814 (1983)); introduction of the 
polynucleotides of the invention using a gene gun (see, e. g., Ulmer et al, Science 259: 
1745 (1993); introducing nucleic acid constructs into embryonic pleuripotent stem cells 

1 5 and transferring the stem cells back into the blastocyst; and sperm mediated gene transfer 
(Lavitrano et al., Cell 57: 717-723 (1989). For a review of such techniques, see Gordon, 
"Transgenic Animals,"//!//. Rev. Cytol 115: 171-229 (1989). 

Any technique known in the art may be used to produce transgenic clones 
containing polynucleotides of the invention, for example, nuclear transfer into enucleated 

20 oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to quiescence 
(Campell et al, Nature 380: 64-66 (1996); Wilmut et al, Nature 385: 810813 (1997)). 

The present invention provides for transgenic animals that carry the transgene in 
all their cells, as well as animals which carry the transgene in some, but not all their cells, 
I. e., mosaic animals or chimeric. The transgene may be integrated as a single transgene 

25 or as multiple copies such as in concatamers, e.g., head-to-head tandems or head-to-tail 
tandems. The transgene may also be selectively introduced into and activated in a 
particular cell type by following, for example, the teaching of Lasko et al. (Lasko et al., 
Proc. Natl Acad. Sci. USA 89: 6232-6236 (1992)). The regulatory sequences required for 
such a cell-type specific activation will depend upon the particular cell type of interest, 

30 and will be apparent to those of skill in the art. When it is desired that the polynucleotide 
transgene be integrated into the chromosomal site of the endogenous gene, gene targeting 
is preferred. Briefly, when such a technique is to be utilized, vectors containing some 
nucleotide sequences homologous to the endogenous gene are designed for the purpose of 
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integrating, via homologous recombination with chromosomal sequences, into and 
disrupting the function of the nucleotide sequence of the endogenous gene. The transgene 
may also be selectively introduced into a particular cell type, thus inactivating the 
endogenous gene in only that cell type, by following, for example, the teaching of Gu et 
5 al. (Gu et aL, Science 265: 103-106 (1994)). The regulatory sequences required for such a 
c^U-type specific inactivation will depend upon the particular cell type of interest, and will 
be apparent to those of skill in the art. 

Once transgenic animals have been generated, the expression of the recombinant 
gene may be assayed utilizing standard techniques. Initial screening may be accomplished 

10 by Southern blot analysis or PCR techniques to analyze animal tissues to verify that 
integration of the transgene has taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may also be assessed using techniques 
which include, but are not limited to, Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and reverse transcriptase-PCR (rt-PCR). 

1 5 Samples of transgenic gene-expressing tissue may also be evaluated 

immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 

20 strategies include, but are not limited to: outbreeding of founder animals with more than 
one integration site in order to establish separate lines; inbreeding of separate lines in 
order to produce compound transgenics that express the transgene at higher levels because 
of the effects of additive expression of each transgene; crossing of heterozygous 
transgenic animals to produce animals homozygous for a given integration site in order to 

25 both augment expression and eliminate the need for screening of animals by DNA 

analysis; crossing of separate homozygous lines to produce compound heterozygous or 
homozygous lines; and breeding to place the transgene on a distinct background that is 
appropriate for an experimental model of interest 

Transgenic animals of the invention have uses which include, but are not limited 

30 to, animal model systems useful in elaborating the biological function of polypeptides of 
the present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 
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Example 14: Knock-Out Animals 

Endogenous gene expression can also be reduced by inactivating or "knocking out" 
the gene and/or its promoter using targeted homologous recombination. (E. g., see 
! Smithies et aL, Nature 317: 230-234 (1985); Thomas & Capecchi, Cell 51: 503512 (1987); 

5 Thompson et aL, Cell 5: 313-321 (1989)) Alternatively, RNAi technology may be used. 
| For example, a mutant, non-functional polynucleotide of the invention (or a completely 

unrelated DNA sequence) flanked by DNA homologous to the endogenous polynucleotide 
; sequence (either the coding regions or regulatory regions of the gene) can be used, with or 

without a selectable marker and/or a negative selectable marker, to transfect cells that 
10 express polypeptides of the invention in vivo. In another embodiment, techniques known 
v| in the art are used to generate knockouts in cells that contain, but do not express the gene 

of interest. Insertion of the DNA construct, via targeted homologous recombination, 
i results in inactivation of the targeted gene. Such approaches are particularly suited in 

research and agricultural fields where modifications to embryonic stem cells can be used 
| 15 to generate animal offspring with an inactive targeted gene (e. g., see Thomas & Capecchi 

l 1987 and Thompson 1989, supra). However, this approach can be routinely adapted for 

i 

. use m humans provided the recombinant DNA constructs are directly administered or 

targeted to the required site in vivo using appropriate viral vectors that will be apparent to 
those of skill in the art. 

20 In further embodiments of the invention, cells that are genetically engineered to 

express the polypeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention (e. g., knockouts) are administered to a 
patient in vivo. Such cells may be obtained from the patient (i.e., animal, including 
human) or an MHC compatible donor and can include, but are not limited to fibroblasts, 
25 bone marrow cells, blood cells (e. g., lymphocytes), adipocytes, muscle cells, endothelial 
cells etc. The cells are genetically engineered in vitro using recombinant DNA techniques 
to introduce the coding sequence of polypeptides of the invention into the cells, or 
! alternatively, to disrupt the coding sequence and/or endogenous regulatory sequence 

: associated with the polypeptides of the invention, e.g., by transduction (using viral 

30 vectors, and preferably vectors that integrate the transgene into the cell genome) or 

transfection procedures, including, but not limited to, the use of plasmids, cosmids, YACs, 
naked DNA, electroporation, liposomes, etc. 

I 

1 
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The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
expression, and preferably secretion, of the polypeptides of the invention. The engineered 
cells which express and preferably secrete the polypeptides of the invention can be 
5 introduced into the patient systemically, e. g., in the circulation, or intraperitoneally. 

Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e. g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. (See, for example, Anderson et aL U. S. Patent No. 5,399,349; and 
10 Mulligan & Wilson, U. S. Patent No. 5,460,959, the contents of which are hereby 
incorporated by reference herein in their entirety). 

When the cells to be administered are non-autologous or non-MHC compatible 
cells, they can be administered using well known techniques which prevent the 
development of a host immune response against the introduced cells. For example, the 
1 5 cells may be introduced in an encapsulated form which, while allowing for an exchange of 
components with the immediate extracellular environment, does not allow the introduced 
cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, but 
are not limited to, animal model systems useful in elaborating the biological function of 
20 polypeptides of the present invention, studying conditions and/or disorders associated with 
aberrant expression, and in screening for compounds effective in ameliorating such 
conditions and/or disorders. 

While preferred illustrative embodiments of the present invention are described, 
one skilled in the art will appreciate that the present invention can be practiced by other 
25 than the described embodiments, which are presented for purposes of illustration only and 
not by way of limitation. The present invention is limited only by the claims that follow. 
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We claim: 

1 . An isolated nucleic acid molecule comprising: 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes an 
amino acid sequence of SEQ ID NO: 95-248; 
5 (b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID NO: 

1-94; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); or 

(d) a nucleic acid molecule having at least 95% sequence identity to the nucleic 
10 acid molecule of (a) or (b). 

2. The nucleic acid molecule according to claim 1, wherein the nucleic acid molecule 
is a cDNA. 

15 3. The nucleic acid molecule according to claim 1, wherein the nucleic acid molecule 
is genomic DNA. 

4. The nucleic acid molecule according to claim 1, wherein the nucleic acid molecule 
is an RNA. 

20 

5. The nucleic acid molecule according to claim 1, wherein the nucleic acid molecule 
is a mammalian nucleic acid molecule. 

6. The nucleic acid molecule according to claim 5, wherein the nucleic acid molecule 
25 is a human nucleic acid molecule. 

7. A method for determining the presence of a colon specific nucleic acid (CSNA) in 
a sample, comprising the steps of: 

(a) contacting the sample with the nucleic acid molecule of SEQ ID NO: 1-94 
30 under conditions in which the nucleic acid molecule will selectively hybridize to a 

colon specific nucleic acid; and 
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(b) detecting hybridization of the nucleic acid molecule to a CSNA in the 
sample, wherein the detection of the hybridization indicates the presence of a 
CSNA in the sample. 

8. A vector comprising the nucleic acid molecule of claim 1 . 

9. A host cell comprising the vector according to claim 8. 

10. A method for producing a polypeptide encoded by the nucleic acid molecule 
according to claim 1, comprising the steps of: 

. (a) providing a host cell comprising the nucleic acid molecule operably linked 
to one or more expression control sequences, and 

(b) incubating the host cell under conditions in which the polypeptide is 
produced. 

11. A polypeptide encoded by the nucleic acid molecule according to claim 1 . 

12. An isolated polypeptide selected from the group consisting of: 

(a) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 95-248 ; or 

(b) a polypeptide comprising an amino acid sequence encoded by a nucleic 
acid molecule having at least 95% sequence identity to a nucleic acid molecule 
comprising a nucleic acid sequence of SEQ ID NO: 1-94. 

13. An antibody or fragment thereof that specifically binds to: 

(a) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 95-248 ; or 

(b) a polypeptide comprising an amino acid sequence encoded by a nucleic 
acid molecule having at least 95% sequence identity to a nucleic acid molecule 
comprising a nucleic acid sequence of SEQ ID NO: 1-94. 

14. A method for determining the presence of a colon specific protein in a sample, 
comprising the steps of: 



WO 2004/050858 



PCT/US2003/038808 



269 

(a) contacting the sample with a suitable reagent under conditions in which the 
reagent will selectively interact with the colon specific protein comprising an 
amino acid sequence with at least 95% sequence identity to of SEQ ID NO: 95- 
248; and 

5 (b) detecting the interaction of the reagent with a colon specific protein in the 

sample, wherein the detection of binding indicates the presence of a colon specific 
protein in the sample. 

15. A method for diagnosing or monitoring the presence and metastases of colon 
10 cancer in a patient, comprising the steps of: 

(a) determining an amount of: 

(i) a nucleic acid molecule comprising a nucleic acid sequence that 
encodes an amino acid sequence of SEQ ED NO: 95-248; 

(ii) a nucleic acid molecule comprising a nucleic acid sequence of SEQ 
15 ID NO: 1-94; 

(iii) a nucleic acid molecule that selectively hybridizes to the nucleic 
acid molecule of (i) or (ii); 

(iv) a nucleic acid molecule having at least 95% sequence identity to the 
nucleic acid molecule of (i) or (ii); 

20 (v) a polypeptide comprising an amino acid sequence with at least 95% 

sequence identity to of SEQ ID NO: 95-248 ; or 
(vi) a polypeptide comprising an amino acid sequence encoded by a 
nucleic acid molecule having at least 95% sequence identity to a nucleic 
acid molecule comprising a nucleic acid sequence of SEQ ID NO: 1-94 

25 and; 

(b) comparing the amount of the determined nucleic acid molecule or the 
polypeptide in the sample of the patient to the amount of the colon specific marker 
in a normal control; wherein a difference in the amount of the nucleic acid 
molecule or the polypeptide in the sample compared to the amount of the nucleic 

30 acid molecule or the polypeptide in the normal control is associated with the 

presence of colon cancer. 
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16. A kit for detecting a risk of cancer or presence of cancer in a patient, said kit 
comprising a means for determining the presence of: 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes an 
amino acid sequence of SEQ ID NO: 95-248; 
5 (b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID NO: 

1-94; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); or 

(d) a nucleic acid molecule having at least 95% sequence identity to the nucleic 
10 acid molecule of (a) or (b); or 

(e) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 95-248 ; or 

(f) a polypeptide comprising an amino acid sequence encoded by a nucleic 
acid molecule having at least 95% sequence identity to a nucleic acid molecule 

15 comprising a nucleic acid sequence of SEQ ID NO: 1-94. 

17. A method of treating a patient with colon cancer, comprising the step of 
administering a composition consisting of: 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes an 
20 amino acid sequence of SEQ ID NO: 95-248; 

(b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID NO: 
1-94; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); 

25 (d) a nucleic acid molecule having at least 95% sequence identity to the nucleic 

acid molecule of (a) or (b); 

(e) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 95-248 ; or 

(f) a polypeptide comprising an amino acid sequence encoded by a nucleic 
30 acid molecule having at least 95% sequence identity to a nucleic acid molecule 

comprising a nucleic acid sequence of SEQ ID NO: 1-94; 
to a patient in need thereof, wherein said administration induces an immune response 
against the colon cancer cell expressing the nucleic acid molecule or polypeptide. 
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1 8. A vaccine comprising the polypeptide or the nucleic acid encoding the polypeptide 



of claim 12. 
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<160> 254 

<170> Patentln version 3.1 

<210> 1 

<211> 2309 

<212> DNA 

<213> Homo sapien 

<400> 1 

catccgggtg gccacagagc aacttcctct agagggagct gattggagcc gggtgccgct 
ggcacctcta tgatcactgg agtctcgcgg gtccctcggg ctgcacaggg acaagtaaag 
gctacatcca gatgccggga atgcactgac gcccattcct ggaaactggg ctcccactca 
gcccctggga gcagcagccg ccagcccctc gggacctcca tctccaccct gctgagccac 
ccgggttggg ccaggatccc ggcaggctga tcccgtcctc cactgagacc tgaaaaatgg 
cttcggggca aggcccaggt cctcccaggc aggagtgcgg agagcctgcc ctgccctctg 
cttctgagga gcaggtagcc caggacacag aggaggtttt ccgcagctac gttttttacc 
gccatcagca ggaacaggag gctgaagggg tggctgcccc tgccgaccca gagatggtca 
ccttacctct gcaacctagc agcaccatgg ggcaggtggg acggcagctc gccatcatcg 
gggacgacat caaccgacgc tatgactcag agttccagac catgttgcag cacctgcagc 
ccacggcaga gaatgcctat gagtacttca ccaagattgc caccagcctg tttgagagtg 
gcatcaattg gggccgtgtg gtggctcttc tgggcttcgg ctaccgtctg gccctacacg 
tctaccagca tggcctgact ggcttcctag gccaggtgac ccgcttcgtg gtcgacttca 
tgctgcatca ctgcattgcc cggtggattg cacagagggg tggctgggtg agtatccaag 
gactgcaatg tcctccctgc tgttggggct gcccctctcc caggccccac cccttcctcg 
ggttcctata tcttctgcta gcctctctgt gaccaccttg tttctcccgg caggtggcag 
ccctgaactt gggcaatggt cccatcctga acgtgctggt ggttctgggt gtggttctgt 
tgggccagtt tgtggtacga agattcttca aatcatgact cccaagggtg ccctttgggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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tcccggttca 


gacccctgcc 


tggacttaag 


cgaagtcttt gccttctctg 


ctcccttgca 


1140 


ggggtccccc 


ctcaagagta 


cagaagcttt 


agcaagtgtg cactccagct 


tcggagggcc 


1200 


cctgcgtggg 


ggccagtcag 


gctgcagagg 


cacctcaaca ttgcatggtg ctagtgggcc 


1260 


ctctctctgg 


gcccaggggc 


tgtggccgtc 


tcctccctca gctctctggg acctccttag 


1320 


ccctgtctgc 


taggcgctgg 


ggagactgat 


aacttgggga ggcaagagac 


tgggagccac 


1380 


ttctccccag 


aaagtgttta 


acggttttag 


ctttttataa tacccttgtg 


agagcccatt 


1440 


cccaccattc 


tacctgaggc 


caggacgtct 


ggggtgtggg gattggtggg 


tctatgttcc 


1500 


ccaggattca 


gctattctgg 


aagatcagca 


ccctaagaga tgggactagg acctgagcct 


1560 


ggtcctggcc 


gtccctaagc 


atgtgtccca 


ggagcaggac ctactaggag 


aggggggcca 


1620 


aggtcctgct 


caactctacc 


cctgctccca 


ttcctccctc cggccatact 


gcctttgcag 


1680 


ttggactctc 


agggattctg 


ggcttggggt 


gtggggtggg gtggagtcgc 


agaccagagc 


1740 


tgtctgaact 


cacgtgtcag 


aagcctccaa 


gcctgcctcc caaggtcctc 


tcagttctct 


1800 


cccttcctct 


ctccttatag 


acacttgctc 


ccaacccatt cactacaggt 


gaaggctctc 


1860 


acccccatcc 


ctgggggcct 


tgggtgagtg 


gcctgctaag gctcctcctt 


gcccagacta 


1920 


cagggcttag 


gacttggttt 


gttatttcag 


ggaaaaggag tagggagttc 


atctggaggg 


1980 


ttctaagtgg 


gagaaggact 


atcaacacca 


ctaggaatcc cagaggtggg 


atcctccctc 


2040 


atggctctgg 


cacagtgtaa 


tccaggggtg 


tagatggggg aactgtgaat 


acttgaactc 


2100 


tgttccccca 


ccctccatgc 


tcctcacctg 


tctaggtctc ctcagggtgg ggggtgacag 


2160 


tgccttctct 


attgggcaca 


gcctagggtc 


ttgggggtca ggggggagaa 


gttcttgatt 


2220 


cagccaaatg 


cagggagggg 


aggcagatgg 


agcccatagg ccacccccta 


tcctctgagt 


2280 


gtttggaaat 


aaactgtgca 


atcccctca 






2309 



<210> 2 

<211> 2880 

<212> DNA 

<213> Homo sapien 

<400> 2 

fcttttttcat ttttcttttt tgtagaggca gggtcttacc atgttgccca ggctggtctt 60 

gaactcctag cctcaaagta attatcccac cttggcctcc aaaagtgttg ggattgcagg 120 

catgagccac tgtttccggt catgttacat ttttaagtag atttctcaaa acaccatgag 180 

atacgtctta cttctcatag gccagaactt agtcacaagg tcacatctag ctatgaggga 240 

ctgggcagct gtgagccata aaaggcagaa gatggcagaa ggggtattgg ataggcaact 300 

ggcagtcttt gctatagaca tccttgcagg gctagggaat tgggtaatgt cccccacagt 360 
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gcctttctat ctgaacatgt cattctttcc atatttggga acctgagaat tcgggttaat 420 

tgcccaccat aaatatcaag gactgtcatg gaatctagtg ctttctggca aagtcagcct 480 

catatgtaat tcagtaaacc tttttgagta cttagtattt gaaaagctgt gagctaggtg 540 

ctttgggggg tatgagaatg agtacagtat cctattttgg tctagaggat catagctcag 6 00 

gaagagaagg tgggaaaagc tgtagtggct gattttaaag actgtccgtg tacgctagac 660 

agtttaactt agcccctgtc ctagttcctg gaaggtcatg gaattgggaa aaagaattta 720 

cttgggccgg acgtggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcggg 780 

aggatcacct gaggtcagga gttcgagacc agcctgacca gtatggagaa accccatctc 84 0 

tactaaaaac acaaaattag ctgggtatgg tggcgcatgc ctgtaatccc agctactcag 900 

gaggctgagg caggagaatc gcttgaacct gggaggcaga ggttgcggtg ggccgagatg 960 

gcgccattgc actccagcct gggcaacaag cgaaactcca tctcaaaaaa aaaaaaaatt 1020 

tactcatgtt ttttaaattt ggctgaaaga atcaggattc ctgaaccttg gatcacacct 1080 

ccagatttgc aagagaaaat ccacattttt gcccaaaaat gtctattctt gacggagagt 1140 

ctaaagcagt tcacagaaaa aatgcagtca gatatggaga aaatccaaga attaagagag 1200 

gctcagttat actcagtgga cgtgactctg gacccagaca cggcctaccc cagcctgatc 1260 

ctctctgata atctgcggca agtgcggtac agttacctcc aacaggacct gcctgacaac 1320 

cccgagaggt tcaatctgtt tccctgtgtc ttgggctctc catgcttcat cgccgggaga 1380 

cattattggg aggtagaggt gggagataaa gccaagtgga ccataggtgt ctgtgaagac 1440 

tcagtgtgca gaaaaggtgg agtaacctca gccccccaga atggattctg ggcagtgtct 1500 

ttgtggtatg ggaaagaata ttgggctctt acctccccaa tgactgccct acccctgcgg 1560 

accccgctcc agcgggtggg gattttcttg gactatgatg ctggtgaggt ctccttctac 1620 

aacgtgacag agaggtgtca caccttcact ttctctcatg ctaccttttg tgggcctgtc 1680 

cggccctact tcagtctgag ttactcggga gggaaaagtg cagctcctct gatcatctgc 174 0 

cccatgagtg ggatagatgg gttttctggc catgttggga atcatggtca ttccatggag 1800 

acctcccctt gaggaggtga attcaggcca aaagggctgt tggctgtaat cctacgccag 1860 

gcacaaggca tcttgttgcc ttgccacgtc ctgtcacagc tgggtatcct taccatgttc 1920 

cacgcccttg cagtgggaga caggatgtcc atgttctcta ccatcctttt ccttcccatg 1980 

cagattgtga aatgtaatga gatgtatcaa gatatcctag aaataaaaac cagatgtcca 204 0 

cctccagtgt ttcatacttt ctggttttac acatcgctgg agggataaag agtatggata 2100 

atctttggat ttggagagcc gttcaagata cttccagctt cttggctcag cctggcttcc 2160 

tctggttcag ccccacataa tgattatggc tatttgctgt catttctggg ctagggctcc 2220 
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tttctaacaa 


cctagactgg 


aataaggccc 


tgtcagcatg 


gctcccttta 


tcccagtttt 


2280 


ccgtctggga 


acagtacctc 


tgcccctgat 


tcccaatgtg 


ccatagtttt 


attaactcca 


2340 


ttaaagaagc 


ctgtatgtgt 


tttggttagt 


tacagttatt 


ttacaataat 


ggtgggtaat 


2400 


ggccccacct 


ctgttatgag 


ataatgttct 


aatcaatgtc 


tctgcctttg 


tatcttttct 


2460 


gagggctttg 


tctgttctct 


tcattctaat 


gaaaggtgta 


ttctagtgct 


gggtgcatat 


2520 


catccaggat 


aatattctgc 


ccaactccat 


cctctgttac 


tagatccctt 


accagtcaca 


2580 


tttgtggact 


ggtggccagt 


cgtataccat 


ccctggaagg 


attctgggac 


aatattccag 


2640 


ggattcattg 


acttcttggc 


tccttttctc 


catttccttt 


gggggaaggg 


ggaattgacc 


2700 


atgcttaagt 


gcatcctatc 


aaggggcagc 


tccgtcccca 


tggccattgg 


atcatgagac 


2760 


actctgaagt 


cagaaggctg 


gggcagatca 


cttcaagcaa 


gcccccatga 


tggttctcag 


2820 


tcctgcttct 


ctgtgggtac 


gtgcccctct 


gtttaaaaat 


aaactgaata 


tggatgttta 


2880 



<210> 3 

<211> 478 

<212> DNA 

<213> Homo sapien 

<400> 3 

tactatatac ggtctaaggt agcgaagatc tgcgggctcg tggagctcct ggatgagatg 60 

aagttctcgc tggagaagct gcaccaaggc atcacagtct cagaccctcc ctttgacacc 120 

cagccccggc ccgatgacag cttttcctga ggaccccggc cacgcagctg ttcccccaca 180 

tggacagatg gacacacaga gcctcggcgg ccactgctgg cacggtgtga gcgccaggca 240 

tctcccaccc gcccctcccg acggcccaac caggggctgt gcagacgtgg ggaccacgga 3 00 

accgagatgc actttagacc agggagctgg cccggcctct ggcaggcccc ccactaactt 360 

attttgcccg gctgaggttg tggggggcgc ctcctggggt gcacgattcc ctcagctctg 420 

ggtttaatgt attatattta tttggggccg acagtgcccc aataaagggt cagaagtg 478 

<210> 4 

<211> 4076 

<212> DNA 

<213> Homo sapien 

<400> 4 

gtccaggagc ttctggccag ggcatagtag ttgctaggga cccttttcta ggtactaggg 60 

gagggggctc cccaacctgc gggcaggcag ctgcctccac gatgtttggt tttaataaag 120 

tctctttgtt tccttctctg tctctctgcc cctctgcttt cccttctctg tccagtccct 180 

ctctccccgg cttctccctc cttctaggcc tctgcggccc ctccccgcct tccctcccgg 24 0 
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gctgggggcg gcgccgtgac gcggcggggc gggggaagga agggggtgtc gctgacgccg 3 00 

cggcggcctc cggcggctcc ggccttttgt gcgggcggtt gggtcgggtg ggggcggcgg 360 

ccgcggaagg ccaggccggt gccctgcggg gacgcccagc gcagcccagc cccgcgcagc 420 

ccagccctgc cctgccctgc cctgccctgc cctgcgcccg gggcgcgccc accgcgccgc 480 

atccatgttc gacaccacac cccactctgg ccggagcacg ccaagcagct ccccatcgct 54 0 

ccggaaacgg ctgcagctcc tgcccccaag ccggccccca cctgagccag aaccaggcac 6 00 

catggtggag aagggatcag atagctcctc agagaagggt ggggtgcctg ggacccccag 660 

cacccagagc ctaggcagcc ggaacttcat ccgcaacagc aagaagatgc agagctggta 720 

cagtatgctg agccccactt ataagcagcg taatgaggac ttccggaaac tgttcagcaa 780 

actccccgaa gcagaacgcc tcattgtgga ttactcctgc gccctgcagc gtgagatcct 84 0 

gctccagggc cgcctctacc tctctgagaa ctggatctgc ttctacagca acatcttccg 900 

ctgggagacc acgatctcca tccagctgaa ggaagtgaca tgtctgaaga aggaaaagac 960 

ggccaagctg atccccaacg ccatccagat ctgcacggag agcgagaagc atttcttcac 1020 

ttcctttggg gcccgtgacc gctgcttcct cctcatcttc cgcctctggc agaatgcact 1080 

gcttgaaaag acgctgagtc cccgcgagct ctggcacctg gtgcatcagt gctacggctc 1140 

agagctgggc ctcaccagtg aggatgagga ctatgtctcc cccttgcagc tgaacggtct 1200 " 

ggggaccccc aaggaagtgg gagatgtgat cgccctgagc gacatcacct cctcgggggc 1260 

agctgaccgc agccaggagc caagcccagt gggttcgcgc cgtggccatg tcacgcccaa 1320 

cctttcccga gccagcagcg acgcagacca tggggcagag gaggacaagg aggagcaggt 1380 

agacagccag ccagacgcct cctccagcca gacagtgacc ccggtggctg aacccccgag 1440 

cacagagccc acccagcctg acgggcccac caccctgggc cccttggatc tgctgcccag 1500 

tgaggagcta ttgacagaca caagtaactc ctcttcatcc actggggagg aaggtgaggc 1560 

aggcgggccc aattcattcg cctccggtac ttgcaagcct cgctcagtct taagcaagag 1620 

gggatggatt cgcccgcagc actgagaatc caggggcagg cgggatggcg ttcaggcgct 168 0 

gttgctagaa atctctgtct ttactctgtt ttgaaggcag catggcaggg tgaacacaag 174 0 

cacagactga aggcagcttg ccgcggttca catcctggtc acaccacttc ctgaccatgt 1800 

cacgtgggca aattatgtaa cttctctgag cttgctttcc tcctctgaaa atggggttaa I860- 

caatgataat acccacttca taggattgtt gttaagagac tataaatgag gctgggtgtg 192 0 

atggctcaca cctgtaatcc cagcactttg ggaggccaag gtaggaagga ttgctagagg 1980 

ccaagagttg gagaacagcc tgggcagcaa agcaagaccc cgttttgtac aaaaaagaaa 204 0 

aaaaagaaaa aactaggtgt ggtggtgcac acctatagta ccagctactt ggcagaccaa 2100 
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ggcaagagga 


tcacatgagg ccaagagttt 


gagacaagcc 


tgggcaacat 


agtgagaccc 


2160 


ctgtctttac 


aaaaaattta 


aaaattagcc 


aggcatgggg 


tcatgcaccc 


tatgtagtct 


2220 


cagctactgg 


ggaggctgag 


gcaggaggat 


cgcttgagcc 


caggagttgg 


aggctgcagt 


2280 


gagctatgat 


tgcaccactg 


cacttcggcc 


tgagcaacag 


agcgagaccc 


tgtctctaaa 


2340 


acgaacaaag 


ggagagaatt 


aagtgatttc 


atacatttaa 


agcacttggg 


ccagttagta 


2400 


agcattgact 


attagtattt 


ctattatcat 


tattgtattc 


attttctctc 


tccactaatt 


2460 


ttcagctagc 


ctctattctt 


gtctcttatg 


gggcagagat 


ggcctcagcc 


ctgccccaga 


2520 


ccagcccaafc 


gagagagagc 


agcctatttt 


tcctagfcgtc 


tacagcggac 


accctagggt 


2580 


tggctctgat 


tgggccaact 


tgagagtgtg 


cctgcccctg 


aaccaatcac 


tgtggttgat 


2640 


cggatagacc 


tgggccaccc 


actcacccct 


gaaccaattg 


ctgtgcccct 


aggttgatgc 


2700 


tcccaggagg 


cgtgctatct 


ggtaatgaat 


ccctgtgatt 


taaatctcag 


ggcctaggtg 


2760 


tgt tctccac 


ccagaaccaa gcacagtggc 


tgtctgtctg 


ggcctggagc 


acgtgcccac 


2820 


cctcaaccca 


tccctgagtc 


acaggagggc 


tctgtgtcag 


tgtccagctg 


tgctccagga 


2880 


ggtggggtcc 


gcccatgcac 


actacaaggc 


ctgggagtgg 


ggaggcgtgg 


cccgcagagg 


2940 


gagggaggag 


tggaggcagt 


catcatcctc 


ggctctcctc 


tgtctccagc 


gggacttggc 


3000 


cgccctgctt 


cccgacctct 


ccggccgcct 


cctcatcaac 


tctgtcttcc 


atgtgggcgc 


3060 


agtgccctct 


gagatccgct 


accgaaagca 


gccgtggagc 


ctggtgaagt 


cgctcattga 


3120 


gaagaactcg 


tggagcggca 


ttgaagacta 


tttccaccat 


ctggagcgag 


agctcgccaa 


3180 


ggctgagaag 


ctgtctctgg aggaaggcgg 


gaaggatgcc 


cggggcttgc 


tatccggcct 


3240 


gcggcggcgg 


aagcggcccc 


tgagctggcg 


ggctcacggg 


gacgggcccc 


agcacccaga 


3300 


tcctgacccc 


tgtgcccggg 


ccggcattca 


cacctcgggc 


tccctcagct 


cccgcttctc 


3360 


cgaaccatct 


gtggaccagg gccccggggc 


aggcatcccc 


agtgccctgg 


ttctcatcag 


3420 


cactgtccct 


atcatcctca 


tcgccctcaa 


cgtcctgctc 


ttctaccgcc 


tctggtccct 


3480 


ggaaaggaca 


gcccacacct 


ttgagtcctg 


gcacagcctg 


gccctggcca 


a gggcaagtt 


3540 


cccccagacg 


gccacagagt 


gggccgagat 


cctggcgctg 


cagaagcaat 


tccacagcgt 


3600 


ggaggtgcac 


aagtggaggc 


agatcctgcg 


ggcctccgtg 


gagctcctgg 


atgagatgaa 


3660 


gttctcgctg 


gagaagctgc 


accaaggcat 


cacagtctca 


gaccctccct 


ttgacaccca 


3720 


gccccggccc 


gatgacagct 


tttcctgagg 


accccggcca 


cgcagctgtt 


cccccacatg 


3780 


gacagatgga 


cacacagagc 


ctcggcggcc 


actgctggca 


cggtgtgagc 


gccaggcatc 


3840 


tcccacccgc 


ccctcccgac ggcccaacca 


ggggctgtgc 


agacgtgggg 


accacggaac 


3900 



WO 2004/050858 



7/383 



PCT/US2003/038808 



cgagatgcac tttagaccag ggagctggcc cggcctctgg caggcccccc actaacttat 3960 
tttgcccggc tgaggttgtg gggggcgcct cctggggtgc acgattccct cagctctggg 4020 
tttaatgtat tatatttatt tggggccgac agtgccccaa taaagggtca gaagtg 40 76 



<210> 5 

<211> 2578 

<212> DNA 

<213> Homo sapien 

<400> 5 



gtccaggagc ttctggccag ggcatagtag ttgctaggga cccttttcta ggtactaggg 
gagggggctc cccaacctgc gggcaggcag ctgcctccac gatgtttggt tttaataaag 
tctctttgtt tccttctctg tctctctgcc cctctgcttt cccttctctg tccagtccct 
ctctccccgg cttctccctc cttctaggcc tctgcggccc ctccccgcct tccctcccgg 
gctgggggcg gcgccgtgac gcggcggggc gggggaagga agggggtgtc gctgacgccg 3 00 
cggcggcctc cggcggotcc ggccttttgt gcgggcggtt gggtcgggtg ggggcggogg 360 
ccgcggaagg ccaggccggt gccctgcggg gacgcccagc gcagcccagc cccgcgcagc 420 
ccagccctgo cctgccctgc cctgccctgc cctgcgcccg gggcgcgcoc accgcgccgc 
atccatgttc gacaccacac cccactctgg ccggagcacg ccaagcagct ccccatcgct 
ccggaaacgg ctgcagctcc tgcccccaag ccggccccca cctgagccag aaccaggcac 
catggtggag aagggatcag atagctcctc agagaagggt ggggtgcctg ggacccccag 
cacccagagc ctaggcagcc ggaacttcat ccgcaacagc aagaagatgc agagctggta 
cagtatgctg agccccactt ataagcagcg taatgaggac ttccggaaac tgttcagcaa 
actccccgaa gcagaacgcc tcattgtgga ttactcctgc gccctgcagc gtgagatcct 
gctccagggc cgcctctacc tctctgagaa ctggatctgc ttctacagca acatcttccg 
ctgggagacc acgatctcca tccagctgaa ggaagtgaca tgtctgaaga aggaaaagac 960 
ggccaagctg atccccaacg ccatccagat ctgcacggag agcgagaagc atttcttcac 1020 
ttcctttggg gcccgtgacc gctgcttcct cctcatcttc cgcctctggc agaatgcact 10 80 
gcttgaaaag acgctgagtc cccgcgagct ctggcacctg gtgcatcagt gctacggctc 1140 
agagctgggc ctcaccagtg aggatgagga ctatgtctcc cccttgcagc tgaacggtct 1200 
ggggaccccc aaggaagtgg gagatgtgat cgccctgagc gacatcacct cctcgggggc 1260 
agctgaccgc agccaggagc caagcccagt gggttcgcgc cgtggccatg tcacgcccaa 1320 
cctttcccga gccagcagcg acgcagacca tggggcagag gaggacaagg aggagcaggt 1380 
agacagccag ccagacgcct cctccagcca gacagtgacc ccggtggctg aacccccgag 



60 
120 
180 
240 



480 
540 
600 
660 
720 
780 
840 
900 



1440 
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cacagagccc acccagcctg acgggcccac caccctgggc cccttggatc tgctgcccag 1500 

tgaggagcta ttgacagaca caagtaactc ctcttcatcc actggggagg aaggacttgg 1560 

ctgccctgct tcccgacctc tccggccgcc tcctcatcaa ctctgtcttc catgtgggcg 1620 

ctgagcggct ccagcagatg ctcttctcgg actcgccctt cctccagggc ttcctacagc 1680 

agtgcaagtt cacagacgtg accctgagcc cctggagtgg ggacagcaag tgccaccagc 174 0 

gccgggtgct gacgtacacc atccccatca gcaacccact gggccccaag agcgcctccg 1800 

tggtggagac acagacgctg ttccggcgcg gcccccaggc cggcgggtgt gtggtggact 1860 

ccgaggtgct gacgcagggc atcccctacc aggactactt ctacactgcc caccgctact 1920 

gcatcctggg tctggcccgg aacaaggcgc ggctccgagt gtcttctgag atccgctacc 1980 

gaaagcagcc gtggagcctg gtgaagtcgc tcattgagaa gaactcgtgg agcggcattg 2040 

aagactattt ccaccatctg gagcgagagc tcgccaaggc tgagaagctg tctctggagg 2100 

aaggcgggaa ggatgcccgg ggcttgctat ccggcctgcg gcggcggaag cggcccctga 2160 

gctggcgggc tcacggggac gggccccagc acccagatcc tgacccctgt gcccgggccg 2220 

gcattcacac ctcgggctcc ctcagctccc gcttctccga accatctgtg gaccagggcc 2280 

ccggggcagg catccccagt gccctggttc tcatcagcat tgtccttatc atcctcatcg 2340 

ccctcaacgt cctgctcttc taccgcctct ggtccctgga aaggacagcc cacacctttg 2400 

agtcctggca cagcctggcc ctggccaagg gccggcctct ggcaggcccc ccactaactt 2460 

attttgcccg gctgaggttg tggggggcgc ctcctggggt gcacgattcc ctcagctctg 2520 

ggtttaatgt attatattta tttggggccg acagtgcccc aataaagggt cagaagtg 2578 

<210> 6 

<211> 480 

<212> DNA 

<213> Homo sapien 

<400> 6 

cggccgcccg ggcaggtaga aaagcagcct cctagtcaga caagccagcc gagaagaagg 60 

aagatgaaag ccaaatggaa gatcctagtg tgcggtgcag ttttcaactg acctctggac 120 

gcagaacttc agccatgaag gtaacaggca tctttcttct cagtgccttg gccctgttga 180 

gtctatctgg taacactgga gctgactccc tgggaagaga ggccaaatgt tacaatgaac 240 

ttaatggatg caccaagata tatgaccctg tctgtgggac tgatggaaat acttatccca 300 

atgaatgcgt gttatgtttt gaaaatcgga aacgccagac ttctatcctc attcaaaaat 360 

ctgggccttg ctgagaacca aggttttgaa atcccatcag gtcaccgcga ggcctgactg 420 

gccttattgt tgaataaatg tatctgaata tccccctgtt gtttccattt gctttttcct 480 
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<210> 7 

<211> 2803 

<212> DNA 

<213> Homo sapien 

<400> 7 

gcccttgatg tcaaagtctc ccacgaagga ggcaggagcg ttttgcccca gtccaagtac 60 

tctgtccgcc accacatccc ctaccccagt aacggggagc tgcattctca gtatcacggc 120 

tactacgtga agctgaacgc cccccagcac cctccagtag acgtggagga cggcgatgga 180 

tccagccagt cttcctcagc actggtgcac aagccatccg ctaacaagtg gagtccctcc 240 

aaatctgtga caaaaccagt tgccaaagaa agcaaaygct gagccaaaag ctaagaagtc 3 00 

tgaacttgct ataccaaagw aatccagcaa gcaacgattc atgccctgct ttggaaaaag 360 

aagccaattc aggccctaat tcaatcatga ttgtccttgt catgctgttg aatatcgggt 420 

tggccattct ttttgttcac tttctaactt gattggaatt aggaaagtga agtcctgaca 480 

actagtggtg tgttagccca cagttctctg cagtggtgcc gtcagccctt taaatggccc 540 

accccaggca gactcagaca cagggaagga ggcttggaat taggatggga tcgcgagaga 600 

gatgaaactt ggaaaattca gccagaggac ctgttcggtt cccctgggag tgctttgtgg 660 

ctttggggtc ctctgggagc tgaagcacac ggccatgttg gagagcaaaa cttaataata 720 

attctttttt aaaatctgct gaagcagccc catccggcga agttcttggc attggctact 780 

ccatctgtcc tagcagagtg tgactaaact ggaaatgtat ggagctgcat tttttttttt 84 0 

gatggaagtc aacagctgcc ttactatttt accatctgac gtttttagta gggagctggg 900 

gaaacgactg cccgaggaat gcggtcggag gattgtgttg ctgtggacgt ggttccaaca 960 

ttcactccta tctcattaga agaaataggt agcagcatca ctcaccagtc ttatgccatg 1020 

acctgctgct ttaacatctc ctggaatgtt ctgggagaga acgtttcgtt tgcttgtgca 1080 

cgactctgtt catttttgct gtttatttaa atctgatgta tagtcatcta ccacctgcgg 114 0 

tcctggccgg acgaaacaca cgtgttgcat ctcaaggaaa agaaacaaca aaatgctagt 1200 

gaaaatgtgc ttattttgat agcaatatac catcttttat gtcatttatt cttcctatac 1260 

ttgtaaaagg ttatttatct ttaaaattta gcaatttgaa gacactatgc cttcctaaga 1320 

actaatttaa ttctaaatat ttttaacatt actgaggtga attatttatg cacttaggaa 1380 

gtgctaaatt ttaaaagctg aaacacaaca gaattctaag aaatatagtc caaacgttgc 1440 

atggattgca gtaatcagtg tttaaaggat tcagtttctt tgctgacgta ctttacaacc 1500 

aaataaaatc ttgtcggtgg ctgtgttaat tcccatgaaa gttaagcaag atgctattaa 1560 

taaactgctc tgctctttct tgttttcttt ttccaactta aatttctgtt gaatacattc 1620 
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aggtagaaca taaagccttg ttcaatcact gcctctcagt tttctgcctt tcctgttttt 1680 

caaagtcctt ttgtaatttc atttcacatt ttaaaggctt agacattttt attttaaata 1740 

tgtgtctttt acagtctttt gtcattctga catttctgga tttttgctgt tttataattt 1800 

accctttgtt attcagaagc atgcttactt atagaaacta aatggtcttt ataaaagtaa 1860 

ttacttaaaa agaaatcagg ggaagaaaga tatctatcta atctattaaa tctttataaa 1920 

acattacatt gcagaggggg agctactcct aaatattttc atgatttgca tggtttaatc 1980 

agattttttt tttttttaca ccatattagc taccttttca atggagaaga gacagttcac 2040 

acaattccct ggttagcaca gatgtggact gagtgctttg tcacctgcag ggtagtaacc 2100 

cagtgatgtt tcttacagaa gcacaatatg ttgaaaatcc tgggtgtgac caatatggaa 2160 

taaagaagaa ggcagaaaga gagcaaatga aaaatttcaa cttgtatatt cattttttac 2220 

attttgcttt gacttttaaa tttaggaagt ccgtttttac ctgagaacaa atgtttaaag 2280 

ttcctgcgtc actctcagta ctctcactgc ccctcccaaa ccctatagct ccttacgctg . 2340 

ggaaagctgg ttttttaaaa aaataataat aaaatattta atcttattaa gtgttcattt 2400 

aaaatgcgta atgctttgga aataatgggt aacagatagc gagaggatat gtttataaag 2460 

tgagcatgtt ggtcccattt ataaatatat gtatgattta taagcttttt taaaacaaag 2520 

ctcaaattgt tggtattttt ctaaaatgtg cacagctgta ttttacatga aggctctttc 2580 

taatgggttg ttatactgta ctcaacattt tggacagcac atgaagtctg ccaatgtact 2640 

taataaaaca tgactttgtt tatttaaagt ttcttgctgt gaaaaagaac tccctacctg 2700 

tgagttcctt tatttataat tcttgaaacc aaaatgtata atgtacagtt ttcacaactg 2760 

tatctgctct aataaaaaaa agttggttat taaaaaaaaa aaa 2803 

<210> 8 

<211> 891 

<212> DNA 

<213> Homo sapien 

<400> 8 

gattctctta atttccaccg ctacagatgt acatgtacat ccagacaaca acacttactg 60 

tctccatgag tttaagtgct tcagtatctc tgggcatgct ctatatgccc aaggtttata 120 

ttataatttt tcatccagaa cagaatgttc aaaaacgcaa gaggagcttc aaggctgtgg 180 

tgacagctgc caccatgcaa agcaaactga tccaaaaagg aaatgacaga ccaaatggcg 24 0 

aggtgaaaag tgaactctgt gagagtcttg aaaccaacac ttcctctacc aagacaacat 3 00 

atatcagtta cagcaatcat tcaatctgaa acagggaaat ggcacaatct gaagagatgt 360 

ggtatatgat cttaaatgat gaacatgaga ccgcaaaaat tcactcctgg agatctccgt 420 
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480 
540 
600 



agactacaat caatcaaatc aatagtcagt cttgtaagga acaaaaatta gccatgagcc 
aaaagtatca ataaacgggg agtgaagaaa cccgttttat acaataaaac caatgagtgt 
caagctaaag tattgcttat tcatgagcag ttaaaacaaa tcacaaaagg aaaactaatg 
ttagctcgtg aaaaaaaatg ctgttgaaat aaataatgtc tgatgttatt ottgtatttt 660 
tctgtgattg tgagaactcc cgttcctgtc ccacattgtt taacttgtat aagacaatga 
gtctgtttct tgtaatggct gaccagattg aagccctggg ttgtgctaaa aataaatgca 
atgattgatg catgcaattt tttatacaaa taatttattt ctaataataa aggtcgcagg 
cgaccagatc tttctatcca catttcctat caatacctcc acaattatct a 



720 
780' 
840 
891 



<210> 9 

<211> 3733 

<212> DNA 

<213> Homo sapien 

<400> 9 

gtgtgctgga aaggcgtgcc cccgctgggc ttcttcaccg cgccctctgc ggggagcagg 60 

gaataattct gctacaaggc tgatttcaag gacatgaatt gttgacctca tcccaacatc 120 

agaacctcag atgttctaat ttttgcacca ttocaggcaa gttgatctta taaggaaata 180 

aaattgaacc ttaggggtct gatggaaatt cactgtgaca ttcaaatcaa gaaaacttgc 240 

taatgcccac agagcctttt ccccatgggc cctgatggta gcctccagaa ggtgcagcct 300 

caggtggtgc cctttctgct gtgttgcaag aataaacttt gggtcttgga ttgcaatacc 360 

acctgtggag aaaatggtat gcgagggaaa gcgatcagcc tcttgccctt gtttcttcct 420 

cttgaccgcc aagttctact ggatcctcac aatgatgcaa agaactcaca gccaggagta 480 

tgcccattcc atacgggtgg atggggacat tattttgggg ggtctcttcc ctgtccacgc 540 

aaagggagag agaggggtgc cttgtgggga gctgaagaag gaaaagggga ttcacagact 600 

ggaggccatg ctttatgcaa ttgaccagat taacaaggac cctgatctcc tttccaacat 660 

cactctgggt gtccgcatcc tcgacacgtg ctctagggac acctatgctt tggagcagtc 720 

tctaacattc gtgcaggcat taatagagaa agatgcttcg gatgtgaagt gtgctaatgg 780 

agatccaccc attttcacca agcccgacaa gatttctggc gtcataggtg ctgcagcaag 840 

ctccgtgtcc atcatggttg ctaacatttt aagacttttt aagatacctc aaatcagcta 900 

tgcatccaca gccccagagc taagtgataa caccaggtat gactttttct ctcgagtggt 960 

tccgcctgac tcctaccaag cccaagccat ggtggacatc gtgacagcac tgggatggaa 1020 

ttatgtttcg acactggctt ctgaggggaa ctatggtgag agcggtgtgg aggccttcac 1080 

ccagatctcg agggagattg gtggtgtttg cattgctcag tcacagaaaa tcccacgtga li 40 
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accaagacct ggagaatttg aaaaaattat caaacgcctg ctagaaacac ctaatgctcg 1200 

agcagtgatt atgtttgcca atgaggatga catcaggagg atattggaag cagcaaaaaa 1260 

actaaaccaa agtgggcatt ttctctggat tggctcagat agttggggat ccaaaatagc 1320 

acctgtctat cagcaagagg agattgcaga aggggctgtg acaattttgc ccaaacgagc 1380 

atcaattgat ggatttgatc gatactttag aagccgaact cttgccaata atcgaagaaa 1440 

tgtgtggttt gcagaattct gggaggagaa ttttggctgc aagttaggat cacatgggaa 1500 

aaggaacagt catataaaga aatgcacagg gctggagcga attgctcggg attcatctta 1560 

tgaacaggaa ggaaaggtcc aatttgtaat tgatgctgta tattccatgg cttacgccct 1620 

gcacaatatg cacaaagatc tctgccctgg atacattggc ctttgtccac gaatgagtac 1680 

cattgatggg aaagagctac ttggttatat tcgggctgta aattttaatg gttgccgaag 1740 

agggatccag atgtctctac cctggccaac tctttttact ccttcatttt ccagtagttg 1800 

ggcagtgctg gcactcctgt cacttttaat gaaaacggag atgctcctgg acgttatgat 1860 

atcttccagt atcaaataac caacaaaagc acagagtaca aagtcatcgg ccactggacc 1920 

aatcagcttc atctaaaagt ggaagacatg cagtgggctc atagagaaca tactcacccg 1980 

gcgtctgtct gcagcctgcc gtgtaagcca ggggagagga agaaaacggt gaaaggggtc 2040 

ccttgctgct ggcactgtga acgctgtgaa ggttacaact accaggtgga tgagctgtcc 2100 

tgtgaacttt gccctctgga tcagagaccc aacatgaacc gcacaggctg ccagcttatc 2160 

cccatcatca aattggagtg gcattctccc tgggctgtgg tgcctgtgtt tgttgcaata 2220 

ttgggaatca tcgccaccac ctttgtgatc gtgacctttg tccgctataa tgacacacct 2280 

atcgtgaggg cttcaggacg cgaacttagt tacgtgctcc taacggggat ttttctctgt 2340 

tattcaatca cgtttttaat gattgcagca ccagatacaa tcatatgctc cttccgacgg 2400 

gtcttcctag gacttggcat gtgtttcagc tatgcagccc ttctgaccaa aacaaaccgt 2460 

atccaccgaa tatttgagca ggggaagaaa tctgtcacag cgcccaagtt cattagtcca 2520 

gcatctcagc tggtgatcac cttcagcctc atctccgtcc agctccttgg agtgtttgtc 2580 

tggtttgttg tggatccccc ccacatcatc attgactatg gagagcagcg gacactagat 264 0 

ccagagaagg ccaggggagt gctcaagtgt gacatttctg atctctcact catttgttca 2700 

cttggataca gtatcctctt gatggtcact tgtactgttt atgccattaa aacgagaggt 2760 

gtcccagaga ctttcaatga agccaaacct attggattta ccatgtatac cacctgcatc 2820 

atttggttag ctttcatccc catctttttt ggtacagccc agtcagcaga aaagatgtac 2880 

atccagacaa caacacttac tgtctccatg agtttaagtg cttcagtatc tctgggcatg 2940 

ctctatatgc ccaaggttta tattataatt tttcatccag aacagaatgt tcaaaaacgc 3000 
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aagaggagct tcaaggctgt ggtgacagct gccaccatgc aaagcaaact gatccaaaaa 3060 

ggaaatgaca gaccaaatgg cgaggtgaaa agtgaactct gtgagagtct tgaaaccaac 3120 

acttcctcta ccaagacaac atatatcagt tacagcaatc attcaatctg aaacagggaa 3180 

atggcacaat ctgaagagat gtggtatatg atcttaaatg atgaacatga gaccgcaaaa 3240 

attcactcct ggagatctcc gtagactaca atcaatcaaa tcaatagtca gtcttgtaag 33 00 

gaacaaaaat tagccatgag ccaaaagtat caataaacgg ggagtgaaga aacccgtttt 3360 

atacaataaa accaatgagt gtcaagctaa agtattgctt attcatgagc agttaaaaca 3420 
aatcacaaaa ggaaaactaa tgttagctcg tgaaaaaaaa tgctgttgaa ataaataatg 
tctgatgtta ttcttgtatt tttctgtgat tgtgagaact cccgttcctg tcccacattg 

tttaacttgt ataagacaat gagtctgttt cttgtaatgg ctgaccagat tgaagccctg • 3600 

ggttgtgcta aaaataaatg caatgattga tgcatgcaat tttttataca aataatttat 3660 

ttctaataat aaaggtcgca ggcgaccaga tctttctatc cacatttcct atcaatacct 3720 
ccacaattat eta 



3480 
3540 



3733 



<210> 10 

<211> 1375 

<212> DNA 

<213> Homo sapien 

<400> 10 

gtgtgctgga aaggcgtgcc cccgctgggc ttcttcaccg cgccctctgc ggggagcagg 60 

gaataattct gctacaaggc tgatttcaag gacatgaatt gttgacctca tcccaacatc 120 

agaacctcag atgttctaat ttttgeacca ttccaggcaa gttgatctta taaggaaata 180 

aaattgaacc ttaggggtct gatggaaatt cactgtgaca ttcaaatcaa gaaaacttgc 240 

taatgcccac agagectttt ccccatgggc cctgatggta gcctccagaa ggtgcagcct 300 

caggtggtgc cctttctgct gtgttgcaag aataaacttt gggtcttgga ttgeaatace 360 

acctgtggag aaaatggtat gcgagggaaa gcgatcagcc tcttgccctt gtttcttcct 420 

cttgaccgcc aagttctact ggatcctcac aatgatgcaa agaactcaca gecaggagta 480 

tgcccattcc atacgggtgg atggggacat tattttgggg ggtctcttcc ctgtccacgc 54 0 

aaagggagag agaggggtgc cttgtgggga gctgaagaag gaaaagggga ttcacagact 600 

ggaggccatg etttatgeaa ttgaccagat taacaaggac cctgatctcc tttccaacat 660 

cactctgggt gtccgcatcc tegacaegtg ctctagggac acctatgett tggagcagtc 720 

tctaacattc gtgeaggcat taatagagaa agatgetteg gatgtgaagt gtgctaatgg 780 

agatccaccc attttcacca agcccgacaa gatttctggc gtcataggtg ctgcagcaag 840 
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ctccgtgtcc 


atcatggttg 


ctaacatttt 


aagacttttt 


aagatacctc 


aaatcagcta 


900 


tgcatccaca 


gccccagagc 


taagtgataa 


caccaggtat 


gactttt tct 


ctcgagtggt 


Q C C\ 


tccgcctgac 


tcctaccaag 


cccaagccat 


ggtggacatc 


gtgacagcac 


tgggatggaa 


10^0 


ttatgtttcg 


acactggctt 


ctgaggggaa 


ctatggtgag 


agcggtgtgg 


aggccttcac 


i n o n 
XOoU 


ccagatctcg 


agggagattg 


gttatccatc 


cctgtttggg 


attcagggct 


gccttcatga 


1140 


atgtttfcgct 


atactatgtc 


aagttgttta 


ccaatttctg 


cttctaatgc 


agctgtcaga 


1200 


tgctcaaact 


gtctattagt 


cgaagaggct 


ctctactgac 


cacctttctc 


aagtttcttc 


1260 


ttcttacata 


caatgttttt 


gacaagacta 


agcatatcaa 


accatctttt 


aacaattgta 


1320 


aagcctaata 


ttggccagca 


tacaattcaa 


ttcattaaac 


attgtagcac 


tgaaa 


1375 



<210> 11 

<211> 2905 

<212> DNA 

<213> Homo sapien 

<400> 11 



gctaaagcac 


ccagagggac 


tagacagact 


ctggctagag 


gcacttgtcc 


tgaaatctga 


60 


ttcctgcctg 


atgatggtgt 


ctcttgtatc 


aatctctctc 


tctggtcttt 


gaattggata 


120 


gaaattataa 


cccagcttca 


tcgttctcca 


agtttggaaa 


tccttctgtg 


tgagtattaa 


180 


gggcatcact 


ttgaagaaag 


gacaaatgtt 


ctcgatggtt 


gatttggaag 


agtaccttta 


240 


tggtagtgtt 


tctcagtggg 


gacaatttgt 


cctccccttc 


ccactctggg 


ataatctggc 


300 


aatgtctgga 


gacattctgg 


gttctcacaa 


ataggaggag 


gatgccactg 


gcatctagta 


360 


ggtcagcgat 


gctgctaagc 


atcctgcagt 


gcccaggaca 


gccccacaaa 


gactcatcca 


420 


gcctgaatgt 


caatagtgct 


gaggtcgagg 


agccttactt 


tacgggaact 


aggcaggagg 


480 


agtctgtatg 


cccaaggacg 


ttggtcatta 


tcacgtgtcc 


acttcagtac 


agcactactc 


540 


agaggcaatg 


ctttttgggt 


aacggatttt 


tctcttaaca 


ggttcttcat 


tttgaaaatg 


600 


tcaaagatgt 


accctttgga 


tttcaaacag 


taacatcgga 


tgtaaacaaa 


cttagttcct 


660 


tttactcact 


gaaactaatc 


aagcggctct 


acgtagacaa 


atctctgaat 


ctttctacag 


720 


agtbcatcag 


ctctacgaag 


agaccctatg 


caaaggaatt 


ggaaactgtt 


gacttcaaag 


780 


ataaattgga 


agaaacgaaa 


ggtcagatca 


acaactcaat 


taaggatctc 


acagatggcc 


840 


actttgagaa 


cattttagct 


gacaacagtg 


tgaacgacca 


gaccaaaatc 


cttgtggtta 


900 


atgctgccta 


ctttgttggc 


aagtggatga 


agaaatttcc 


tgaatcagaa 


acaaaagaat 


960 


gtcctttcag 


agtcaacaag 


acagacacca 


aaccagtgca 


gatgatgaac 


atggaggcca 


1020 


cgttctgtat 


gggaaacatt 


gacagtatca 


attgtaagat 


catagagctt 


ccttttcaaa 


1080 
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ataagcatct cagcatgttc atcctactac ccaaggatgt ggaggatgag tccacaggct 1140 

tggagaagat tgaaaaacaa ctzcaactcag agtcactgtc acagtggact aatcccagca 1200 

ccatggccaa tgccaaggtc aaactctcca ttccaaaatt taaggtggaa aagatgattg 1260 

atcccaaggc ttgtctggaa aatctagggc tgaaacatat cttcagcgaa gacacatctg 1320 

atttctctgg aatgtcagag accaagggag tggccctatc aaatgttatc cacaaagtgt 1380 

gcttagaaat aactgaagat ggtggggatt ccatagaggt gccaggagca cggatcctgc 1440 

agcacaagga tgaattgaat gctgaccatc cctttattta catcatcagg cacaacaaaa 1500 

ctcgaaacat cattttcttt ggcaaattct gttctcctta agtggcatag cccatgttaa 1560 

gtcctccctg acttttctgt ggatgccgat ttctgtaaac tctgcatcca gagattcatt 1620 

ttctagatac aataaattgc taatgttgct ggatcaggaa gccgccagta cttgtcatat 1680 

gtagccttca cacagataga cctttttttt ttttccaatt ctatcttttg tttccttttt 1740 

tcccataaga caatgacata cgcttttaat gaaaaggaat cacgttagag gaaaaatatt 1800 

tattcattat ttgtcaaatt gtccggggta gttggcagaa atacagtctt ccacaaagaa 1860 

aattcctata aggaagattt ggaagctctt cttcccagca ctatgctttc cttctttggg 1920 

atagagaatg ttccagacat tctcgcttcc ctgaaagact gaagaaagtg tagtgcatgg 1980 

gacccacgaa actgccctgg ctccagtgaa acttgggcac atgctcaggc tactataggt 2040 

ccagaagtcc ttatgttaag ccctggcagg caggtgttta ttaaaattct gaattttggg 2100 

gattttcaaa agataatatt ttacatacac tgtatgttat agaacttcat ggatcagatc 2160 

tggggcagca ccctataaat caacacctta atatgctgca acaaaatgta gaatattcag 2220 

acaaaatgga tacataaaga ctaagtagcc cataaggggt caaattttgc tgccaaatgc 2280 

gtatgccacc aacttacaaa aacacttcgt tcgcagagct tfctcagattg tggaatgttg 2340 

gataaggaat tatagacctc tagtagctga aatgcaagac cccaagagga agttcagatc 24 00 

ttaatataaa ttcactttca tttttgatag ctgtcccatc tggtcatttg gttggcacta 2460 

gactggtggc aggggcttct agctgactcg cacagggatt ctcacaatag ccgatatcag 2520 

aatttgtgtt gaaggaactt gtctcttcat ctaatatgat agcgggaaaa ggagaggaaa 2580 

ctactgcctt tagaaaatat aagtaaagtg attaaagtgc tcacgttacc ttgacacata 2640 

gtttttcagt ctatgggttt agttacttta gatggcaagc atgtaactta tattaatagt 2700 

aatttgtaaa gttggttgga taagctatcc atgttgcagg ttcatggatt acttctctat 2760 

aaaaaatatg tatttaccaa aaaattttgt gacattcctt ctcccatctc ttccttgaca 2820 

tgcattgtaa ataggttctt cttgttctga gattcaatat tgaatttctc ctatgctatt 2880 
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gacaataaaa tattattgaa ctaca 2905 

<210> 12 

<211> 6682 

<212> DNA 

<213> Homo sapien 

<400> 12 



aatttataag 


attcatagaa 


atagacaatg 


caaagagttc 


cctgaggaca ggaaccacat 


60 


ctgtattgtt 


cacttctgta 


acccagtgct 


ttgcccatat 


tagatgccca gaaaataaca 


120 


attgagttta 


tgaatgacct 


ctgatatctc 


tacatcctta 


attcagtctt 


atttttatcc 


180 


actaaaatca 


tgagattagt 


gaaattgttt 


aagcacgtaa 


ggatagtttt 


actgttaatt 


240 


tagattttac 


caataagcaa 


gagtagtact 


attgctggat 


ttatatgggt 


ttgatataat 


300 


tttataatcc 


aaaagaaaga 


ttggaaaggg 


atgacaagtt 


ggttaattca 


ttcttaactg 


360 


aaagatagtg 


gatagcatag cacataaaag 


cactggactc 


tggagtaaaa 


ctgcctgtat 


420 


ttcagtctcg 


gctatgtgat 


cttggtcaag 


ttaccttacc 


tctttgtgta 


ttagctttca 


480 


catgtaaaat 


gggaatggca 


gtaataacac 


ttccttataa 


ggtataatat 


tgtacatttt 


540 


gtgaggatta 


aatgagttaa 


tacaaatata 


acactatatt 


attatattga 


tgttgttaga 


600 


tgttattatt 


gaaattgcta 


aattatttga 


tagttttacc 


caaagttgag attagtcttg 


660 


atcacttata 


aataattttt 


atcttcatga 


ttaggctttt 


cagaattctt 


tagctcatca 


720 


ttaatgactg 


ttattctcat 


agcattggag 


gcgttgtact 


ggtaaacctg gcagggtctg 


780 


aaaaacctgc 


tggaagagac 


acagtaggtt 


ccatttggtc 


tcttgctgga gccatgctct 


840 


atgctgtcta 


tattgttatg attaagagaa 


aagtagatag 


agaagacaag 


ttggatattc 


900 


caatgttctt 


tggttttgta 


ggtttgttta 


atctgctgct 


cttatggcca 


ggtttctttt 


960 


tacttcatta 


tactggattt 


gaggacttcg 


agtttcccaa 


taaagtagta 


ttaatgtgca 


1020 


ttatcattaa 


tggccttatt 


ggaacagtac 


tctcagagtt 


cctgtggttg 


tggggctgct 


1080 


ttcttacctc 


atcattgata 


ggcacacttg 


cactaagcct 


tacaatacct 


ctgtccataa 


1140 


tagctgacat 


gtgtatgcaa 


aaggtgcagt 


tttcttggtt 


attttttgca ggagctatcc 


1200 


ctgtattttt 


ttcatttttt 


attgtaactc 


tcctatgcca 


ttataataat 


tgggatcctg 


1260 


tgatggtggg 


aatcagaaga 


atatttgctt 


ttatatgcag 


aaaacatcga attcagagag 


1320 


ttccagaaga 


cagcgaacag 


tgtgagagtc 


tcatttctat 


gcacagtgtt 


tctcaggagg 


1380 


atggagctag 


ttagctgtct 


gttgtctgta 


gcccagcttg 


ataatggaac 


tatacagcga 


1440 


agagacaatc 


tctggcaagt 


ttttgtagaa 


aaaatgtttc 


agtgcctagt 


ctgaaaaata 


1500 


acagtttgag 


ttctttgaaa 


ctctaaaata 


tatttttctc 


atacctgttt 


tcttcatttt 


1560 
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cataatgaag cactttgcta tgtagctgtg tacatatcac tacagttata ggaagtttca 1620 

gtctacagtc catccaaagg accaacctgc cttacacatc tcaaggaatt cagctgttga 1680 

aatcatttga actaatcaag gaataaatcc taatgttctg ggactttatt ttcacatgtt 174 0 

aaatgctgga atatattatg aaaatgtttt caagaaatca cttaagtgtt catagaccag 1800 

tatttctgac aggtaaaatg ctaaaataag ctacctgtaa taagtgtgga ttatattttt 1860 

gggttttgta gaatattgca aattaaccac acaaaaaatg tttaatttat gcaacaagca 1920 

tgtttgtgca aatttcatgg gactttaaaa agaataagta tttgagaaaa tatctggttc 1980 

acttacacta catttactgt attattcttt tatagcatta ggtgccttgt attttaaatc 2040 

tgtgacaaac catggcaaat ttttaaaggg gaagtattat tataaaatga agaaatatgt 2100 

atttctaaag gctatattgc tgtaaactta attgataaag ctctgtttaa tttagagttt 2160 

tgaagaaata gtctcccttc aattaagaaa ttttcataat ggaatgattt aaattgaagt 2220 

gacaaagagt attattaaaa tacaatgttt atacgtctat ttgtgtattg tagatgtatc 2280 

aagtgatttc taattttttt cacatatgaa tgtgccagat tactctagaa ctagatgtct 2340 

cttctttaaa taattttagt tttcctgaat aaatttgtaa tggttaaagt accaagtaag 2400 

taaggcgaga agggattctg tttttaaaat cacatcagaa cttttcctct actaagatta 2460 

taaattaaat gtaaaatact cctaattgca attcttaaac ttaggcctta catgtactta 2520 

ttatgcaact gctcctggac tctattcacc atagatatca gtaaacgtat gtcccaggat 2580 

tcacaggctt ttgattaatc aatattcttt tcaagtttgc tgtgaagagt ttagttctct 2640 

tcaaaatttc ttaactaatc tgatttttaa gaattctctt tgcagtgttt agcttcctat 2700 

tcacattctt aaaattgctt tggtgttacc atgagtctaa aatgaagttt agccttcctt 2760 

ttgtttcatt ctgagaactt ctatattata ttacctttaa aaattgttta tgatattaaa 2820 

tttaaaatac aaacagctct cttttttttt tttttttttt ttaatcatcc agcccaaagt 2880 

ggcaaaaaca gctcttttct catttggcac caccaataac gcaagttaaa aataatgttg 2940 

agtttattat acttttgacc tgtttagctc aacagggtga aggcatgtaa agaatgtgga 3000 

cttctgagga attttctttt aaaaagaaca taatgaagta acattttaat tactcaagga 3060 

ctacttttgg ttgaagttta taatctagat acctctactt tttgtttttg ctgttcgaca 3120 

gttcacaaag accttcagca atttacaggg taaaatcgtt gaagtagtgg aggtgaaact 3180 

gaaatttaaa attattctgt aaatactata gggaaagagg ctgagcttag aatcttttgg 3240 

ttgttcatgt gttctgtgct cttatcatca cacaggtcat gtgttgttac tcaggattct 3300 

ggaagtacta agccgtagtt aacaggctgg atagatcttc agccaatctt ctttcattat 3360 

actgctgctt ttctgtttct taaaaaacaa aaatcaatga acaacttctt tagaaggaag 3420 
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caccactgtt caattggtta actgaaagta tgaatcactg ccgctttgct tgcccacctg 3480 

tttgcttttt ctcacaggtt ttatttatct gaatgattgt tgcatttgaa tactgtagcc 3540 

atggtgagtg agcagttgaa gacttccttt ctgcaccttc atagttaagg gttcatcttc 3600 

tcaagaaata aagttgtgct gggttgtttc aattctgtag atcacctaga aactaggtaa 3660 

caaggccctt ggatcacaat gagcacattg caggtctttg gaaatgctgg gaaggggtta 3720 

cattcaagta actgctgacg tccttcctct ctgtcagacc catgttggcc tgtaattata 3780 

tttttctgaa tctaaaaaca aatagaaatt tctgatatct tttcagtctc cttcttttct 3840 

ttccattcat cacttaaatc tcattattgt ataacgtttc aaattatgac ctggattgaa 3 900 

ggaagtctgt tttgtcaggg actttgttag gtgaacatca gaatctcaac tacaacaaag 3960 

ctgaaaaatg aggcagcctt aacagtaaat gctggctaaa accaattctt tagattgttt 4020 

ttgccaccat ttgcttactt ggaatagctt tcacatttat ttttcttatg aagaaatcca 4080 

gtgtcagtca gaccatgtaa catcaaagca aaatttcatt gaagcctatc cttgtcttaa 4140 

ttattctaaa ataaagtaca ataaagtgaa ttattttgcc ctttgtagta gcatagtgtt 4200 

aagttatata tatattttaa tatgcatact cctaagtaaa gtttcagaga ccacatgctg 4260 

gagagcagtg gctttcagac agcatcagca tcacctgaag gccttgttga atacacatta 4320 

ttggaagaga ggggagaaca caggctcatg cctgtaattc cagtgctttg ggagactgag 4380 

gcgggaggat cacgtgagcc caggagttca agatcaggct gggcaataca gcaagacccc 4440 

tgtctctaca aaaacaaaaa caaaaaacaa acaggcatgg tagtgcatac ctgtagtctg 4500 

agctgcttgg gagtctgagg caggaggatc acttgaaccc aggagttcga ggctgcagtg 456 0 

agttattatt gggccactgc agtccagcct gggcagcaaa gtgagaccct gtctgttaaa 4620 

aaaacaaaac caaaaacttg attgctgggg gccaccctag agtttctgat tcagtaggtc 4680 

tggcggtggg gatggtgggc ctggtaattt gcctttctaa caagcgtcca ggtaatgccc 474 0 

cagctgctgg tcccaggacc acactttgca gattattact ctgagaagag cactaactct 4800 

gacgttggat tccttctgtt cctcattagt gaatttattt gggggagatt attcaatctg 4860 

gtagatccgc atttacttag ctgtgaagtg gaggttaatg caacctgcct gagactgtga 4920 

gaatgagaaa ctctaattga aataccaggc ccagcgtgcc tggatgcttg aaaaatgctg 4980 

actcccatgg tccgtccttc cctcactctt ctcagctata aagtgaggag cgaaattcag 504 0 

tatcttttaa gattcctctc aggtgtaaaa tgctgtgatt ctaaaagttt aattttgtag 5100 

ttccagagac aatgccaaac acctgtgtgt gcttagtgct ttgtaaagag tggtaataac 5160 

agtgacacca cataggccgt agttttgtag ttccagagac aatgccaaac acctgtgtgt 5220 
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acttagtgct 
ccagagacaa 
tgacaccaca 
ccttaaataa 
gagtacagtt 
atttccctga 
atgttcaaaa 
gggagtccga 
ggagaaaccc 
gtggtcccag 
ttgcagtggg 
ttcaaaaaaa 
caacaattct 
aaatccttgt 
caaataaagc 
agattgcatc 
gtaatcagta 
ggtaatcaaa 
ttagaaatgt 
cttgataatt 
aaaatttaaa 
ggccaaaact 
atttgggaac 
gcttcatttg 
aaaagtacag 
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ttgtaaagag 
tgccaaacac 
taggccgtaa 
ggttttcagt 
ggaatccatg 
gttgctggcc 
ggtatagttt 
ggcgggtgga 
cttctctact 
ctacttgtga 
ctgagatcac 
taagaaataa 
tatatgggac 
aaagggttat 
tgaagttatc 
afctctgatta 
gctatgtgca 
gagtactttc 
ttttcttggt 
tgatccctcc 
cacaattata 
accctggttt 
attattttat 
tgtaaaatca 
aaaaacctca 



tgaataacag 
ctatgtgtac 
ttttgttttt 
cagatttata 
gcagttttat 
ttattagtcc 
tcatcacctg 
tcacctgaga 
aaaaatacaa 
ggctgaggta 
accactgcac 
aaagtgagtc 
caggtatatt 
tttatataca 
tagtgtcagc 
agttgcagca 
aatattaagt 
ggaagatgaa 
gtaatttaat 
atctatttaa 
aaattaatta 
tgtcttgcaa 
agactgtgga 
tttgtatatt 



tgacaccaca 
ttagtgcttt 
gcttttgtta 
aagcagcatt 
ggaggccaaa 
ctactctgtt 
cttgatcttg 
ttaggagttc 
aaaattagcc 
ggagaatcgc 
tccagcctgg 
ttcatattca 
taatttgcct 
attttgtagt 
atactgtgat 
ttttagtttt 
caacttaatt 
atatttataa 
gcagattttt. 
tatttaatat 
taattcctac 
cttgaaacat 
taaccaaata 
ctatttatag 



taggccatag 
gtaaagagtg 
gggtgttcag 
gaagaagttg 
tatatcagat 
gccccctggt 
attttcagac 
aagaccagtg 
aggcatggtg 
ttgaacccaa 
gtgacagagc 
aattagtata 
ttctttgcag 
atattctaaa 
ttgtatattt 
atctgatagg 
accagaccca 
tgagttatgg 
acctcaattt 
tttcagattc 
ttatgttagc 
tgtgtatttt 
taagatttgt 
tatgcatttt 



ttttgtagtt 
gtaataacag 
catcttgact 
tattatattt 
gaataggcta 
tacctaatgc 
agagcacttt 
tgaccaacat 
gtgcacgcct 
gaggcagagg 
gagaccctgt 
tgaggttagg 
tcatcctagt 
tcccctttgg 
atataagaca 
cctgtattga 
gaaggccaga 
aaacaatgtt 
tgttaaaaaa 
tagagtattt 
tatgatttag 
aaactatata 
gatttttaat 
ctagctcatt 



aa 



<210> 13 

<211> 2992 

<212> DNA 

<213> Homo sapien 

<400> 13 

ggcggaggtt tagccactcc ctcgtccttt gctctgcccc agacacccac gctcgctttc 
ctcttcttcc tccacctcgc cctacttccc ttcttcctat cttctctctt ctttcgcctc 



5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

654 0 

6600 

6660 

6682 



60 
120 
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gccgccgcga gccctcgctc ggcgcccaga gcctcgcatt acttccccta gctccgccaa 180 

gcctcgctgc atacgcgccc gcgcgcagtc cgccgtcgcg cccctatgac gtcacccgcg 240 

cgccaccgct tccgggctgg ggactggggg cggggccgcg cggagggcaa gccgggggcg 300 

gggccggggc cagagcagat ctccgggcct ccgcggccat agctgactgt gccgtccctt 360 

cccctcaccc gctccacgcc ctcctgggcc gagtggagtt gggtggtgtc gggagcctct 420 

ccctgagggg caccgcgtct tcaggagctg ggcctccagt gcggcgcgat gtcaggcgcg 480 

gtgacagctc tgtgagtccg aggccgcggc cgtggcgctg ggcggctgcg gggcctgacc 540 

ggtccgctca tggtgccgcc acgacgccat cgcggggcag gaaggccagg ggtgctgagt 600 

tcttcacctc cttttagact gagatctgcc aagttttccg gcattgctct tgaggatctc 660 

agaagggctc ttaagacaag actgcaaatg gtgtgtgtat ttgtcatgaa ccgaatgaat 720 

tcccagaaca gtggtttcac tcagcgcagg cgaatggctc ttgggattgt tattcttctg 780 

cttgttgatg tgatatgggt tgcttcctct gaacttactt cgtatgtttt tacccagtac 840 

aacaaaccat tcttcagcac ctttgcaaaa acatctatgt ttgttttgta ccttttgggc 900 

tttattattt ggaagccatg gagacaacag tgtacaagag gacttcgcgg aaagcatgct 960 

gctttttttg cagatgctga aggttacttt gctgcttgca caacagatac aactatgaat 1020 

agttctttga gtgaacctct gtatgtgcct gtgaaattcc atgatcttcc aagtgaaaaa 1080 

cctgagagca caaacattga tactgaaaaa acccccaaaa agtctcgtgt gaggttcagt 1140 

aatatcatgg agattcgaca gcttccgtca agtcatgcat tggaagcaaa gttgtctcgc 1200 

atgtcatatc ctgtgaaaga acaagaatcc atactgaaaa ctgtggggaa acttactgca 1260 

actcaagtag cgaaaattag cttttttttt tgctttgtgt ggtttttggc aaatttgtca 1320 

tatcaagaag cactttcaga cacacaagtt gctatagtta atattttatc ttcaacttcc 1380 

ggacttttta ccttaatcct tgctgcagta tttccaagta acagtggaga tagatttacc 1440 

ctttctaaac tattagctgt aattttaagc attggaggcg ttgtactggt aaacctggca 1500 

gggtctgaaa aacctgctgg aagagacaca gtaggttcca tttggtctct tgctggagcc 1560 

atgctctatg ctgtctatat tgttatgatt aagagaaaag tagatagaga agacaagttg 1620 

gatattccaa tgttctttgg ttttgtaggt ttgtttaatc tgctgctctt atggccaggt 1680 

ttctttttac ttcattatac tggatttgag gacttcgagt ttcccaataa agtagtatta 1740 

atgtgcatta tcattaatgg ccttattgga acagtactct cagagttcct gtggttgtgg 1800 

ggctgctttc ttacctcatc attgataggc acacttgcac taagccttac aatacctctg 1860 

tccataatag ctgacatgtg tatgcaaaag gtgcagtttt cttggttatt ttttgcagga 1920 

gctatccctg tatttttttc attttttatt gtaactctcc tatgccatta taataattgg 1980 
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gatcctgtga tggtgggaat cagaagaata tttgctttta tatgcagaaa acatcgaatt 
cagagatgga atcttgctct gtcgcccaag ctggagtgca gtggcgagat ctcgggtcac 
tgcaacctct acctcctggg ctcaagcgat tctcctgcct cagcctcccg agtagctggg 
actacaggcg cacgccacca cacccagcta atttttgtat ttttagtaga gacggggttt 
caccatgttg gtcaggatgg tctcgatctc ttgaactcgt gatccgccct ccttggcttc 
ccaaagtgct gggattatag gcgtgagcca ccgcgcccgg cccacagcat catttttcaa 
gggtctctag ggtgaccagg tgtccgggtt tgccaaggac agtcccagtt tctacctgct 
gtcctaggac aatcatgaat agtgcccctt tcagtttcaa aagtgtcata gtttagacat 
taaattacct gaccaccctt atatgcccca tgtttcttaa catctaggct gctgacaaaa 
ttcacatttt ctgactgtca gtctagtgtt ctctatacta gactgatacg gatccacaaa 
ggaagcaata ttggccaaca attaatcttc agtctttgga ttagtgagag gtgacaacgt 
gctggcagcc gtcacagccc tcgctcactc tccccgcctc ctctgcctgg tttcccagtt 
tgacggcact tgaggagccc ttcagcccgc cgctgcactg tgggagcccc ttcctgggct 
ggccgaggcc ggaggcggct cccccagctt gcggggaggt gtggagggag aggcgagggc 
gggaacaggg gctgcgcgcg gtgcttgcgg gccagcgcga gttctgggtg ggtgtgggct 
cggcgggccc cgcactcgga gcggccggca ggccctgccg gccccgggca atgaggggct 
tagcacccgg gccagcggct gcggagggtg tgccgggtcc cccagcagtg cc 



<210> 14 

<211> 717 

<212> DNA 

<213> Homo sapien 

<400> 14 



aatcactata ggcgcattgg tgctctagat gctgctcgag cggcgcagtg tgatggatgt 
ggtcgcggcc gaggtgtctg tttgacttta cgcccatctc aggacacttc cgtagactgt 
ttaggttccc ctgtcaaata tcagttaccc actcggtccc agttttgttg ccccagaaag 
ggatgttatt atccttgggg gctcccaggg caagggttaa ggcctgaatc atgagcctgc 
tggaagccca gcccctactg ctgtgaaccc tggggcctga ctgctcagaa cttgctgctg 
tcttgttgcg gatggatgga aggttggatg gatgggtgga tggccgtgga tggccgtgga 
tgcgcagtgc cttgcatacc caaaccaggt gggagcgttt tgttgagcat gacagcctgc 
agcaggaata tatgtgtgcc tatttgtgtg gacaaaaata tttacactta gggtttggag 
ctattcaaga ggaaatgtca cagaagcagc taaaccaagg actgagcacc ctctggattc 
tgaatctcaa gatgggggca gggctgtgct tgaaggccct gctgagtcat ctgttagggc 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

2992 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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cttggttcaa taaagcactg agcaagttga gaaaaacaac aaccacaaca caaaataaaa 660 

gctctttttt ttcttctttt ttgggcaaaa aggccctttt ttttcgggaa agggttt 717 

<210> 15 

<211> 440 

<212> DNA 

<213> Homo sapien 

<400> 15 

aatcactata ggcgcattgg tgctctagat gctgctcgag cggcgcagtg tgatggatgt 60 

ggtcgcggcc gaggtctggg gcctgactgc tcagaacttg ctgctgtctt gttgcggatg 120 

gatggaaggt tggatggatg ggtggatggc cgtggatggc cgtggatgcg cagtgccttg 180 

catacccaaa ccaggtggga gcgttttgtt gagcatgaca gcctgcagca ggaatatatg 240 

tgtgcctatt tgtgtggaca aaaatattta cacttagggt ttggagctat tcaagaggaa 3 00 

atgtcacaga agcagctaaa ccaaggactg agcaccctct ggattctgaa tctcaagatg 360 

ggggcagggc tgtgcttgaa ggccctgctg agtcatctgt tagggccttg gttcaataaa 420 

gcactgagca agttgagaaa 440 

<210> 16 

<211> 4387 

<212> DNA 

<213> Homo sapien 

<400> 16 

ggcacgagga agacaaaaaa aaaataagcg taacttccct caaagcaaca accccccccc 60 

ccacccctcg cccgctgcct ccctcggccc agccagctgt gccggcgttt gttggctgcc 120 

ctgcgcccgg ccctccagcc agccttctgc cggccccgcc gcgatggagg tgccccagcc 180 

ggagcccgcg ccaggctcgg ctctcagtcc agcaggcgtg tgcggtggcg cccagcgtcc 240 

gggccacctc ccgggcctcc tgctgggatc tcatggcctc ctggggtccc cggtgcgggc 3 00 

ggccgcttcc tcgccggtca ccaccctcac ccagaccafcg cacgacctcg ccgggctcgg 360 

caggtaggac accccaagga ggctgcatat gggggtgaga ggctaggtct ggagtcctgg 420 

gcctcctgga gaactgggca tggtagtcga atccaggagg tggagtccgg ggaggcatcc 480 

tgagcagtgg agaggtacgc ccttgttccc tcctaggggc ccaggacatg ggaagtgaga 540 

cctgggctgt gccccccaca cctccaggtg acctgctctc agtctgcaca agcctctcca 600 

aatcttcttt cccctcactc cctcctccct cattgtgcct ccaccacgct gcctccgtaa 660 

tctctgcccg gatagccacc attttgcccc aggatgggaa tctcaagcct ggagagggaa 720 

cttaagggtg gcagagccgt ggagcactca cacatccatg cccctgctcc ctccacccca 780 
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gttctgcccc cagaattaag acaattaatt aaacactgat cctgctccct aagtctgcaa 840 
ggggagagct gggttgttcc agtaggagaa cctggtcttg gtccatcctt caggaagggt 900 
cctggaaagg tgggggtctg acagtttagg ggaaggctct ctggggcggc tctgctagac 960 
caccaaccct taggctggcc tccagccagg cagccttgct cacaccctgg gctcaccttg 1020 
ccctgctcct gagaggacct ttcctaaacc agtcaacgaa gagaagacgc tttagggcct 1080 
ctgtagaatg aaccttcctt gacctgagaa gagggggccc tcagggccat tgctttcccg 1140 
gtgtagtgtt tcgctcagtt cttttctcag caagggagtg ggaacgtggg ctagccaggc 1200 
cttgagcccc acgtttacct ttctccttto ccgggtccct gttcccttcc cagcgaaacc 1260 
ccaaagagtc aggtagggac cctgctcttc cgcagccgca gccgcctgac gcacctatcc 1320 
ctgtctcgac gggcatccga atcctccctg tcgtctgaat cctccgaatc ttctgatgca 1380 
ggtctctgca" tggattcccc cagccctatg gacccccaca tggcggagca gacgtttgaa 1440 
caggccatcc aggcagccag ccggatcatt cgaaacgagc agtttgccat cagacgcttc 1500 
cagtctatgc cggtgaggct gctgggccac agccccgtgc ttcggaacat caccaactcc 1560 
caggcgcccg acggccggag gaagagcgag gcgggcagtg gagctgccag cagctctggg 1620 
gaagacaagg agaatgtgcg cttctggaag gccggggtgg gagctctccg ggaagaggag 1680 
ggggcatgct ggggtggttc cctggcatgt gaggaccctc ctctcccatc ttggctgcag 1740 
gatggatttg tcttcaagat gccatggaag cccacacatc ccagctccac ccatgctctg 1800 
gcagagtggg ccagccgcag ggaagccttt gcccagagac ccagctcggc ccccgacctg i860 
atgtgtctca gtcctgaccg gaagatggaa gtggaggagc tcagccccct ggccctaggt 1920 
cgcttctctc tgacccctgc agagggggat actgaggaag atgatggatt tgtggacatc 1980 
ctagagagtg acttaaaggt aaacagcctt gtcccaccag gcccctacct tcctatccct 2040 
gggttcgccc aaaagaagag agctctgagc ccttgtggca ggaccatgat gtcattccag 2100 
tgtcagaaga agaatctgag gtaactgagt cacagcctaa cctctggcca atgagagaag 2160 
aacaggtggc tggtgccaca gtggagggcg gtttgggagg taggggagct tctcacccca 2220 
acctggagtt tgccagggaa acagggaagg ctgtgctgga ggattccggg actggagggg 2280 
tttggcctat gcctgtgggt tgtgaccctg tccttgctaa tttggcctca gatgatgatg 2340 
cagttccccc aggcatggag agtctcatta gtgccccact ggtcaagacc ttggaaaagg 2400 
aagaggaaaa ggacctcgtc atgtacagca agtgccagcg gctcttccgc tctccgtcca 2460 
tgccctgcag cgtgatccgg cccatcctca agaggctgga gcggccccag gacagggaca 2520 
cgcccgtgca gaataagcgg aggcggagcg tgacccctcc tgaggagcag caggaggctg 2580 
aggaacctaa agcccgcgtc ctccgctcaa aatcactgtg tcacgatgag atcgagaacc 2640 
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tcctggacag tgaccaccga gagctgattg gagatfcactc taaggccttc ctcctacaga 2700 

cagtagacgg aaagcaccaa gacctcaagt acatctcacc agaaacgatg gtggccctat 2760 

tgacgggcaa gttcagcaac atcgtggata agtttgtgat tgtagactgc agatacccct 2820 

atgaatatga aggcgggcac atcaagactg cggtgaactt gcccctggaa cgcgacgccg 2880 

agagcttcct actgaagagc cccatcgcgc cctgtagcct ggacaagaga gtcatcctca 2940 

ttttccactg tgaattctca tctgagcgtg ggccccgcat gtgccgtttc atcagggaac 3 000 

gagaccgtgc tgtcaacgac taccccagcc tctactaccc tgagatgtat atcctgaaag 3060 

gcggctacaa ggagttcttc cctcagcacc cgaacttctg tgaaccccag gactaccggc 3120 

ccatgaacca cgaggccttc aaggatgagc taaagacctt ccgcctcaag actcgcagct 3180 

gggctgggga gcggagccgg cgggagctct gtagccggct gcaggaccag tgaggggcct 324 0 

gcgccagtcc tgctacctcc cttgcctttc gaggcctgaa gccagctgcc ctatgggcct 3300 

gccgggctga gggcctgctg gaggcctcag gtgctgtcca tgggaaagat ggtgtgggtg 3360 

tcctgcctgt ctgccccagc ccagattccc ctgtgtcatc ccatcatttt ccatatcctg 3420 

gtgcccccca cccctggaag agcccagtct gttgagttag ttaagttggg ttaataccag 3480 

cttaaaggca gtattttgtg tcctccagga gcttcttgtt tccttgttag ggttaaccct 3540 

tcatcttcct gtgtcctgaa acgctccttt gtgtgtgtgt cagctgaggc tgggggagag 3600 

ccgtggtccc tgaggatggg tcagagctaa actccttcct ggcctgagag tcagctctct 3660 

gccctgtgta cttcccgggc cagggctgcc cctaatctct gtaggaaccg tggtatgtct 3720 

gccatgttgc ccctttctct tttccccttt cctgtcccac catacgagca cctccagcct 3780 

gaacagaagc tcttactctt tcctatttca gtgttacctg tgtgcttggt ctgtttgact 3840 

ttacgcccat ctcaggacac ttccgtagac tgtttaggtt cccctgtcaa atatcagtta 3900 

cccactcggt cccagttttg ttgccccaga aagggatgtt attatccttg ggggctccca 3960 

gggcaagggt taaggcctga atcatgagcc tgctggaagc ccagccccta ctgctgtgaa 4 020 

ccctggggcc tgactgctca gaacttgctg ctgtcttgtt gcggatggat ggaaggttgg . 4080 

atggatgggt ggatggccgt ggatggccgt ggatgcgcag tgccttgcat acccaaacca 4140 

ggtgggagcg ttttgttgag catgacagcc tgcagcagga atatatgtgt gcctatttgt 4200 

gtggacaaaa atatttacac ttagggtttg gagctattca agaggaaatg tcacagaagc 4260 

agctaaacca aggactgagc accctctgga ttctgaatct caagatgggg gcagggctgt 4320 

gcttgaaggc cctgctgagt catctgttag ggccttggtt caataaagca ctgagcaagt 4380 

tgagaaa 4387 
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<210> 17 
<211> 2283 
<212> DNA 
<213> Homo sapien 

<400> 17 

tgcttcctgg gcgccgtggg cgcggactgc gcgggctgcg cgggtgccga ggagcgcgag 60 

gcgcggggaa ggcgcacctg gggtggccct ggcgtgcggg cggcgacatg gaggacggcg 120 

tgctcaagga gggcttcctg gtcaagaggg gccacattgt ccacaactgg aaggcgcgat 180 

ggttcatcct tcggcagaac acgctggtgt actacaagct tgaggggggt cggagagtga 240 

cccctcccaa gggccggatc ctcctggatg gctgcaccat cacctgcccc tgcctggagt 3 00 

atgaaaaccg accgctcctc attaagctga agactcaaac atccacggag tacttcctgg 360 

aggcctgtfcc tcgagaggag cgggatgcct gggcctfctga gatcaccggg gctattcatg 420 

cagggcagcc ggggaaggtc cagcagctgc acagcctgag aaactccttc aagctgcccc 4 80 

cgcacatcag cctgcatcgc attgtggaca agatgcacga tagcaacacc ggaatccgtt 54 0 

caagccccaa catggagcag ggaagcacct ataaaaagac cttcctcggc tcctccctgg 600 

tggactggct catctccaac agcttcacgg ccagccgtct ggaggcggtg accctggcct ■ 660 

ccatgctcat ggaggagaac ttcctcaggc ctgtgggtgt ccgaagcatg ggagccattc 720 

gctctgggga tctggccgag cagttcctgg atgactccac agccctgtac acttttgctg 780 

agagctacaa aaagaagata agccccaagg aagaaattag cctgagcact gtggagttaa 840 

gtggcacggt ggtgaaacaa ggctacctgg ccaagcaggg acacaagagg aaaaactgga 900 

aggtgcgtcg ctfctgttcta aggaaggatc cagctttcct gcattactat gacccttcca 960 

aagaagagaa caggccagtg ggtgggtttt ctcttcgtgg ttcactcgtg tctgctctgg 1020 

aagataatgg cgttcccact ggtaaggtac aacttcagcc ctggcctgaa gatctcaggg 1080 

gcctccctcc tgcacctcag taatgtccca ctaatgaaat gacagctttg ggagccgagc 1140 

agggctttct ccatgagaca gagtggtgtg agttttgaag tgaatttcag aattgaacaa 1200 

agggtatgtt cttggcgttt taagaataca actaagtaat ccttatgtta ataatctttc ' 1260 

ccctcagttt ctgtgctgat gccccataag cttccgccta aaataatagg tgaggcttac 1320 

acaatgctct agggttaaca acataccgta atatggtatc ataatactaa tttcacaata 1380 

gccagccata gtaagaagat attattccca tcttgtgtct gtggaaactg ggggttccaa 1440 

taagttaaac aatttgagat caaactccaa attctatgct tgcttttaac ataaacatac 1500 

tggattcttt gatgtaaatg aaatggtaga tctatggcat gctaccatct tgagcatagg 1560 

gagcaggcaa gagggcttcc aggaatatcc agacataacc attggaaggc cttagggcac 1620 
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tggtgaggga cagggaatgg gaggaaaagg ggacctcatc tgataggatt ctctaactgt 1680 

aaacctttta tttaataaat acatcattgt atgtttttgt ccatgtctca gttgactttc 1740 

tgctggttca agatgaaagc agacatggct attcaatcac tgattttttt cccccttttt 1800 

tctgcctagg ggttaaaggg aatgtccagg gaaacctctt caaagtgatt actaaggatg 1860 

acacacacta ttacattcag gccagcagca aggctgagcg agccgagtgg attgaagcta 1920 

tcaaaaagct aacatgacaa ggacctgagg gaaccaggat tcctccctcc taccagatga 1980 

cacagacaag agttcctgga gaatgggagt gttaagactt ttgacttctt tgtaagtttt 2040 

gtactgcttt ggagagtgaa tgctgccaag agttcctcag attacaaaca gcagtggtgc 2100 

catttccttc cccatcttca tgttacaaac ctggaaaggc tagaacagcc attaggcgtc 2160 

agcatcttga cttttcccca gcatcacaaa cagccatttc ctcgggcacc aaagtaggtt 2220 

ccctttgttg gaacaattac actggccatg ccataatgtt gaataaaact ctcttcttat 2280 

2283 

<210> 18 
<211> 1046 
<212> DNA 
<213> Homo sapien 

<400> 18 

agagtccgcc tatcagccaa tgaagagaca gcagtgagag cggttgcgca gtgaaggcta 60 

gacccggttt actggaattg ctctggcgat cgaggggtcc tagtacaccg caatcatgtc 120 

tattatgtcc tataacggag gggccgtcat ggccatgaag gggaagaact gtgtggccat 180 

cgctgcagac aggcgcttcg ggatccaggc ccagatggtg accacggact tccagaagat 240 

ctttcccatg ggtgaccggc tgtacatcgg tctggccggg ctcgccactg acgtccagac 300 

agttgcccag cgcctcaagt tccggctgaa cctgtatgag ttgaaggaag gtcggcagat 360 

caaaccttat accctcatga gcatggtggc caacctcttg tatgagaaac ggtttggccc 420 

ttactacact gagccagtca ttgccgggtt ggacccgaag acctttaagc ccttcatttg 480 

ctctctagac ctcatcggct gccccatggt gactgatgac tttgtggtca gtggcacctg 540 

cgccgaacaa atgtacggaa tgtgtgagtc cctctgggag cccaacatgg tacgttggtg 600 

gcatggagag gggctgggct ctgagttacc cacccttggt cattaggaaa aaagtgtgtt 660 

tatgtggcag gtaatgggga gaatggtgca ggtgggagag aaagccacag ctgctgctcc 720 

ttggaagggt atccagaaat tccatctgcc aggctggctg gctggctcca gaggagagca 780 

tcttctgtta ggagaccaca gttagatatt tgccacagca gtaagaattc cttacattgg 84 0 

gtttacaggg ttctcgcatc cattcactga gcaaacattt aactcagtgt agggttagag 900 
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gcagctcaac aggttgtctg. ggttggcaac ctggatccac ccaccacctt agttattgtt 960 

cttggagaag ccagtttccc tctctgcctc cactaccttg actgtaaaat gaggataaaa 102 0 

aagtctgcct tagaagtggt gctgag 1046 

<210> 19 

<211> 591 

<212> DNA 

<213> Homo sapien 

<400> 19 

agagtccgcc tatcagccaa tgaagagaca gcagtgagag cggttgcgca gtgaaggcta 60 

gacccggttt actggaattg ctctggcgat cgaggggtcc tagtacaccg caatcatgtc 120 

tattatgtcc tataacggag gggccgtcat ggccatgaag gggaagaact gtgtggccat 180 

cgctgcagac aggcgcttcg ggatccaggc ccagatggtg accacggact tccagaagat 24 0 

ctttcccatg ggtgaccggc tgtacatcgg tctggccggg ctcgccactg acgtccagac 3 00 

agttgcccag cgcctcaagt tccggctgaa cctgtatgag ttgaaggaag gtcggcagat 360 

caaaccttat accctcatga gcatggtggc caacctcttg tatgagaaac ggtttggccc 420 

ttactacact gagccagtca ttgccgggtt ggacccgaag acctttaagc ccttcatttg 480 

ctctctagac ctcatcggct gccccatggt gactgatgac tttgtggtca gtggcacctg 54 0 

cgccgaacaa atgtacctcg gccgcgacca cgctaagccg aattccagca c 591 

<210> 20 

<211> 1471 

<212> DNA 

<213> Homo sapien 

<400> 20 

caaaacttga ccgtaccctc cagttccccc agcctgtggg aagccccagc agctgaaccg 60 

ggttgtgggc ggcgaggaca gcactgacag cgagtggccc tggatcgtga gcatccagaa 120 

gaatgggacc caccactgcg caggttctct gctcaccagc cgctgggtga tcactgctgc 180 

ccactgtttc aaggagtatg tacagccggc ctggggcact tgatcttcta aggccctggg 240 

ctttgtgccc caggccctgg gctcccttta caggctctgc ttccgggcct ctgttcaatc 300 

ttggtgcccc tcgattctac ctagaagcct ctcctgcctt caggggcctt cacacccatg 360 

ctgttccccc aggtacctcc tcccagacca ggcctggctg atcctcccct tccctttcct 420 

tccagcaacc tgaacaaacc atacctgttc tctgtgctgc tgggggcctg gcagctgggg 480 

aaccctggct ctcggtccca gaaggtgggt gttgcctggg tggagcccca ccctgtgtat 54 0 

tcctggaagg aaggtgcctg tgcagacatt gccctggtgc gtctcgagcg ctccatacag 600 

ttctcagagc gggtcctgcc catctgccta cctgatgcct ctatccacct ccctccaaac 660 
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acccactgct ggatctcagg ctgggggagc atccaagatg gagttccctt gccccaccct 720 
cagaccctgc agaagctgaa ggttcctatc atcgactcgg aagtctgcag ccatctgtac 780 
tggcggggag caggacaggg acccatcact gaggacatgc tgtgtgccgg ctacttggag 840 
ggggagcggg atgcttgtct gggcgactcc gggggccccc tcatgtgcca ggtggacggc 900 
gcctggctgc tggccggcat catcagctgg ggcgagggct gtgccgagcg caacaggccc 960 

ggggtctaca tcagcctctc tgcgcaccgc tcctgggtgg agaagatcgt gcaaggggtg 1020 

cagctccgcg ggcgcgctca ggggggtggg gccctcaggg caccgagcca gggctctggg 1080 

gccgccgcgc gctcctaggg cgcagcggga cgcggggctc ggatctgaaa ggcggccaga 114 0 

tccagatctg gatctggatc tgcggcggcc tcgggcggtt tcccccgccg taaataggct 1200 

catctactct acctctgggg gcccggacgg ctgctgcgga aaggaaaccc cctccccgac 1260 

ccgcccgacg gcctcaggcc ccgcctccaa ggcatcaggc cccgcccaac ggcctcatgt 1320 

ccccgccccc acgacttccg gccccgcccc cgggccccag cgcttttgtg tatataaatg 1380 

ttaatgattt ttataggtat ttgtaaccct gcccacatat cttatttatt cctccaattt 1440 

caataaatta tttattctcc agaaaaaatt t 1471 

<210> 21 

<211> 2312 

<212> DNA 

<213> Homo sapien 

<400> 21 

gcaagtgtcg gtcggggcct cgagctgcct gagctgacac gaggggaggg gtctgtgtag 60 

ccaacaggtg accgaagggc ttgcctgccc acagcttact tggccaaggg gtttctgaag 120 

ttcctgccac aacctgaggt tgtggctggg aagagacgca gagggcacag agctgagccg 180 

caggcacaac tggacaaagc cagagcccca ggcccctgtg gcctgggaaa gagtagctcc 240 

ctccaacctc ccccagggcc acccgctgcc caagtccttc tcctccccac cttctccttc 300 

gaacaagagg gaggaggagg aggaggagtt caacttcgag gtcatcccac cgccgccaga 360 

gttcagcaat gaccctgagc ccccggcccc ggccctccag tatctgggcc gccagagctc 420 

ccctccccgg aacaactact cagacttgag gcagctcccg aacgctggcc ccggggcgcc 480 

cccggctctc ggcttctcgc gctttcccgc gggcgcgcgc tacgccgggg ctgggggcct 540 

ggagcgcttc tcgggagggg gccgctcgct cataaagaag cgcctgtacg tcggggagcc 600 

gcaccgaggc ccagggctac cccacggtgg caccggccgc agcctgagct ctcccaactg 660 

cttcgggccg cagcccggag gccccgagat gcggcgcgtg aactcggcgg gtcgcgcgcc 720 

ccccggaggc ctgcacgcgc cgaggctgtc cctggagggc gccgcccggg gcgccgcgga 780 
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ggccaagcac aaagcgcccg gcagcgccga ctacggcttc gccccagctg ccggcaggtc 84 0 

tccctacacc accacccgct atggaagccc catcaacacg ttcaccgtga ggcctgggac 900 

ccgccatccc atctcctatg tctgctcagg ggcccatcgg aaagccacct cctgagccag 960 

gggttctctt agctctactc caacaaactg tgtctttaag gatctgggca cttgtttctt 1020 

gtggggggtg ggaggtacac agcaggtgtt gggcgacctg actcagacac agaggagtct 1080 

ttgatcccca aagaccaacg atgagaagca gtggcccagg aagaccgaga gtgggcttct 1140 

gtttcccaaa gcgctggtgt gactgtggac tgtttacacg tggggggtgg gaggggaagg 1200 

agggtagaga gaaggaacca ttaaggacct gggcgctgag tttctgctct ctggtactat 1260 

ctgctttagc aagagttatt taaaacagtt ttacaagcac ggcctactga gcaggttctg 1320 

aaagcaaggg gtggggcact gactgaactg gaacccatcg agatagctgt gattgcctgt 13 80 

caggcccttg caaagggctg aggcccagct ctgtctggtg ggtgtgcagg gcaccgaagg 1440 

gagatcttta gcctccatca cctgccatcc caagcaagag tggacacctg gggttcctag 1500 

ccgcattgag gtgggaccag gaatgtgtat gtataacatg tggagaagga ccatgtgtgg 1560 

ggtggggagt gaagctccca catgtggaat ggaagccctt tgcattggcc aagggcagag 1620 

gtgggagatt gatggttcat ttttccttgg ccctagccct tagctacagg gaggcctaga 1680 

gagaggacac ttggcaagtt taggatggga aacaaggtta agcctggctt tgctattcag 1740 

acctgtgcag ccctgatttt ggggactggg cccaaggagc catgagtctt aggcttgaca 1800 

ctaggggcct tacatggcca ggtctcatat gggatttgga gaaggcattt ttcctaacac 1860 

tgatactaca agtcaccaat actcttgatg ccttaagatg ctagaaagcc ttggagagtg 1920 

gaggcagcag agatgggaat ggaggtctga agaagcttgc tgctgttggg gaatgtgttt 1980 

gggaaagtca agagtggtag gattggttgg cfcgggcgcgg tggctcacac ctgtaatccc 2040 

agcactttgg gaggccgagg ccggtggatc acttgaggtc agaagttcga gaccagcctg 2100 

cccaaatggt gaaaccccat ctctactaaa aatacggaaa aaaaaaaaaa aagagtgcta 2160 

ggattgggcc agttaagagc aggggagacc tggctgggca tggtggctca cacctgtaat 2220 

tctagcactt tgggaggctg aggcgggtgg atcatgaggt caagagatta agaccatcct 2280 

ggcctacatg gcgaaaccct gtctctacta aa 2312 

<210> 22 

<211> 8227 

<212> DNA 

<213> Homo sapien 

<400> 22 

cgcccccgcc tctgagctcc cttcccatgg cggccctagt gttggaggac gggtcggtcc - 60 
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tgcggggcca gccctttggg gccgccgtgt cgactgccgg ggaagtggtg tttcaaaccg 120 

gcatggtcgg ctaccccgag gccctcactg atccctccta caaggcacag atcttagtgc 180 

tcacctatcc tctgatcggc aactatggca tccccccaga tgaaatggat gagttcggtc 24 0 

tctgcaagtg gtttgaatcc tcgggcatcc acgtagcagc actggtagtg ggagagtgct 300 

gtcctactcc cagccactgg agtgccaccc gcaccctgca tgagtggctg cagcagcatg 360 

gcatccctgg cttgcaagga gtagacactc gggagctgac caagaagttg cgggaacagg 420 

ggtctctgct ggggaagctg gtccagaatg gaacagaacc ttcatccctg ccattcttgg 480 

accccaatgc ccgccccctg gtaccagagg tctccattaa gactccacgg gtattcaata 540 

cagggggtgc ccctcggatc cttgctttgg actgtggcct caagtataat cagatccgat 600 

gcctctgcca gcgtggggct gaggtcactg tggtaccctg ggaccatgca ctagacagcc 660 

aagagtatga gggtctcttc ttaagtaatg ggcctggtga ccctgcctcc tatcccagtg 720 

tcgtatccac actgagccgt gttttatctg agcctaatcc ccgacctgtc tttgggatct 780 

gcctgggaca ccagctattg gccttagcca ttggggccaa gacttacaag atgagatatg 840 

ggaaccgagg ccataaccag ccctgcttgt tggtgggctc tgggcgctgc tttctgacat 900 

cccagaacca tgggtttgct gtggagacag actcactgcc agcagactgg gctcctctct 960 

tcaccaacgc caatgatggt tccaatgaag gcattgtgca caacagcttg cctttcttca 1020 

gtgtccagtt tcacccagag caccaagctg gcccttcaga tatggaactg cttttcgata 1080 

tctttctgga aactgtgaaa gaggccacag ctgggaaccc tgggggccag acagttagag 1140 

agcggctgac tgagcgcctc tgtccccctg ggattcccac tcccggctct ggacttccac 1200 

caccacgaaa ggttctgatc ctgggctcag ggggcctctc cattggccaa gctggagaat 1260 

ttgactactc gggctctcag gcaattaagg ccctgaagga ggaaaacatc cagacgttgc 1320 

tgatcaaccc caatattgcc acagtgcaga cctcccaggg gctggccgac aaggtctatt 13 80 

ttcttcccat aacacctcat tatgtaaccc aggtgatacg taatgaacgc cccgatggtg 1440 

tgttactgac ttttgggggc cagactgctc tgaactgtgg tgtggagctg accaaggccg 1500 

gggtgctggc tcggtatggg gtccgggtcc tgggcacaac agtggagacc attgagctga 1560 

ccgaggatcg acgggccttt gctgccagaa tggcagagat cggagagcat gtggccccga 1620 

gcgaggcagg aaattctctt gaacaggccc aggcagccgc tgaacggctg gggtaccctg 1680 

tgctagtgcg tgcagccttt gccgtgggtg gcctgggctc tggctttgcc tctaacaggg 1740 

aggagctctc tgctctcgtg gccccagctt ttgcccatac cagccaagtg ctagtagaca 1800 

agtctctgaa gggatggaag gagattgagt acgaggtggt gagagacgcc tatggcaact 1860 
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2520 
2580 



gtgtcacggt gtgtaacatg gagaacttgg acccactggg catccacact ggtgagtcca 1920 
tagtggtggc ccctagccag acactgaatg acagggagta tcagctcctg aggcagacag i960 
ctatcaaggt gacccagcac ctgggaattg ttggggagtg caatgtgcag tatgccttga 2040 
accctgagtc tgagcagtat tacatcattg aagtgaatgc caggctctct cgcagctctg 2100 
ccctggccag taaggccaca ggttatccac tggcttatgt ggcagccaag ctagcattgg 2160 
gcatcccttt gcctgagctc aggaactctg tgacaggggg tacagcagcc tttgaaccca 2220 
gcgtggatta ttgtgtggtg aagattcctc gatgggacct tagcaagttc ctgcgagtca 2280 
gcacaaagat tgggagctgc atgaagagcg ttggtgaagt catgggcatt gggcgttcat 2340 
ttgaggaggc cttccagaag gccctgcgca tggtggatga gaactgtgtg ggctttgatc 2400 
acacagtgaa accagtcagc gatatggagt tggagactcc aacagataag cggatttttg 2460 
tggtggcagc tgctttgtgg gctggttatt cagtggaccg cctgtatgag ctcacacgca 
tcgaccgctg gttcctgcac cgaatgaagc gtatcatcgc acatgcccag ctgctagaac 
aacaccgtgg acagcctttg ccgccagacc tgctgcaaca ggccaagtgt cttggcttct 264 0 
cagacaaaca gattgccctt gcagttctga gcacagagct ggctgttcgc aagctgcgtc 2700 
aggaactggg gatctgtcca gcagtgaaac agattgacac agttgcagct gagtggccag 2760 
cccagacaaa ttacctatac ctaacgtatt ggggcaccac ccatgacctc acctttcgaa 2820 
cacctcatgt cctagtcctt ggctctggcg tctaccgtat tggctctagc gttgaatttg 2880 
actggtgtgc tgtaggctgc atccagcagc tccgaaagat gggatataag accatcatgg 
tgaactataa cccagagaca gtcagcaccg actatgacat gtgtgatcga ctctactttg 
atgagatctc ttttgaggtg gtgatggaca tctatgagct cgagaaccct gaaggtgtga 3060 
tcctatccat gggtggacag ctgcccaaca acatggccat ggcgttgcat cggcagcagt 
gccgggtgct gggcacctcc cctgaagcca ttgactcggc tgagaaccgt ttcaagtttt 
cccggctcct tgacaccatt ggtatcagcc agcctcagtg gagggagctc agtgacctcg 3240 
agtctgctcg ccaattctgc cagaccgtgg ggtacccctg tgtggtgcgc ccctcctatg 
tgctgagcgg tgctgctatg aatgtggcct acacggatgg agacctggag cgcttcctga 
gcagcgcagc agccgtctcc aaagagcatc ccgtggtcat ctccaagttc atccaggagg 3420 
ctaaggagat tgacgtggat gccgtggcct ctgatggtgt ggtggcagcc atcgccatct 3480 
ctgagcatgt ggagaatgca ggtgtgcatt caggtgatgc gacgctggtg acccccccac 3540 
aagatatcac tgccaaaacc ctggagcgga tcaaagccat tgtgcatgct gtgggccagg 3600 
agctacaggt cacaggaccc ttcaatctgc agctcattgc caaggatgac cagctgaaag 3660 
ttattgaatg caacgtacgt gtctctcgct ccttcccctt cgtttccaag acactgggtg 3720 



2940 
3000 



3120 
3180 



3300 
3360 
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tggacctagt agccttggcc acgcgggtca tcatggggga agaagtggaa cctgtggggc 3780 

taatgactgg ttctggagtc gtgggagtaa aggtgcctca gttctccttc tcccgcttgg 3840 

cgggtgctga cgtggtgttg ggtgtggaaa tgaccagtac tggggaggtg gccggctttg 3 900 

gggagagccg ctgtgaggca tacctcaagg ccatgctaag cactggcttt aagatcccca 3 960 

agaagaatat cctgctgacc attggcagct ataagaacaa aagcgagctg ctcccaactg 4 020 

tgcggctact ggagagcctg ggctacagcc tctatgccag tctcggcaca gctgacttct 4 080 

acactgagca tggcgtcaag gtaacagctg tggactggca ctttgaggag gctgtggatg 4140 

gtgagtgccc accacagcgg agcatcctgg agcagctagc tgagaaaaac tttgagctgg 4200 

tgattaacct gtcaatgcgt ggagctgggg gccggcgtct ctcttccttt gtcaccaagg 4260 

gctaccgcac ccgacgcttg gccgctgact tctccgtgcc cctaatcatc gatatcaagt 4320 

gcaccaaact ctttgtggag gccctaggcc agatcgggcc agcccctcct ttgaaggtgc 4380 

atgttgactg tatgacctcc caaaagcttg tgcgactgcc gggattgatt gatgtccatg 4440 

tgcacctgcg ggaaccaggt gggacacata aggaggactt tgcttcaggc acagccgctg 4500 

ccctggctgg gggtatcacc atggtgtgtg ccatgcctaa tacccggccc cccatcattg 4560 

acgcccctgc tctggccctg gcccagaagc tggcagaggc tggcgcccgg tgcgactttg 4620 

cgctattcct tggggcctcg tctgaaaatg caggaacctt gggcaccgtg gccgggtctg 4680 

cagccgggct gaagctttac ctcaatgaga ccttctctga gctgcggctg gacagcgtgg 4740 

tccagtggat ggagcatttc gagacatggc cctcccacct ccccattgtg gctcacgcag 4800 

agcagcaaac cgtggctgct gtcctcatgg tggctcagct cactcagcgc tcagtgcaca 4860 

tatgtcacgt ggcacggaag gaggaggtaa gagtacacct gagatcctgc tgtccctgtt 4920 

gctttcccag taacaccaaa ggtcagggta gtccttaggg gcaggagaca gcaggaggag 4980 

agtctggaga cagcaggagg agggtctcga gccagcaccc ttgcaggtct acatcatcat- 5040 

ttttttgagc agggatgttg gccattggga ctttgtgtag ggcaaggcat atgggtggtg 5100 

cctcttctgg atcttcccat tgttccccag atcctgctaa ttaaagctgc aaaggcacgg 5160 

ggcttgccag tgacctgcga ggtggctccc caccacctgt tcctaagcca tgatgacctg 5220 

gagcgcctgg ggcctgggaa gggggaggtc cggcctgagc ttggctcccg ccaggatgtg 5280 

gaagccctgt gggagaacat ggctgtcatc gactgctttg cctcagacca tgctccccat 5340 

accttggagg agaagtgtgg gtccaggccc ccacctgggt tcccagggtt agagaccatg 54 00 

ctgccactac tcctgacggc tgtaagcgag ggccggctca gcctggacga cctgctgcag 5460 

cgattgcacc acaatcctcg gcgcatcttt cacctgcccc cgcaggagga cacctatgtg 5520 
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gaggtggatc tggagcatga gtggacaatt cccagccaca tgcccttctc caaggcccac 5580 

tggacacctt ttgaagggca gaaagtgaag ggcaccgtcc gccgtgtggt cctgcgaggg 5640 

gaggttgcct atatcgatgg gcaggttctg gtacccccgg gctatggaca ggatgtacgg 5700 

aagtggccac agggggctgt tcctcagctc ccaccctcag cccctgccac tagtgagatg 5760 

accacgacac ctgaaagacc ccgccgtggc atcccagggc ttcctgatgg ccgcttccat 5820 

ctgccgcccc gaatccatcg agcctccgac ccaggtttgc cagctgagga gccaaaggag 5880 

aagtcctctc ggaaggtagc cgagccagag ctgatgggaa cccctgatgg cacctgctac 5940 

cctccaccac cagtaccgag acaggcatct ccccagaacc tggggacccc tggcttgctg 6000 

cacccccaga cctcacccct gctgcactca ttagtgggcc aacatatcct gtccgtccag 6060 

cagttcacca aggatcagat gtctcacctg ttcaatgtgg cacacacact gcgtatgatg 6120 

gtgcagaagg agcggagcct cgacatcctg aaggggaagg tcatggcctc catgttctat 6180 

gaagfcgagca cacggaccag cagctccttt gcagcagcca tggcccggct gggaggtgct 6240 

gtgctcagct tctcggaagc cacatcgtcc gtccagaagg gcgaatccct ggctgactcc 6300 

gtgcagacca tgagctgcta tgccgacgtc gtcgtgctcc ggcaccccca gcctggagca 6360 

gtggagctgg ccgccaagca ctgccggagg ccagtgatca atgctgggga tggggtcgga 6420 

gagcacccca cccaggccct gctggacatc ttcaccatcc gtgaggagct gggaactgtc 6480 

aatggcatga cgatcacgat ggtgggtgac ctgaagcacg gacgcacagt acattccctg 654 0 

gcctgcctgc tcacccagta tcgtgtcagc ctgcgctacg tggcacctcc cagcctgcgc 6600 
atgccaccca ctgtgcgggc cttcgtggcc tcccgcggca ccaagcagga ggaattcgag .. 6660 

agcattgagg aggcgctgcc tgacactgat gtgctctaca tgactcgaat ccagaaggaa 6720 

cgatttggct ctacccagga gtacgaagct tgctttggtc agttcatcct cactccccac 6780 

atcatgaccc gggccaagaa gaagatggtg gtgatgcacc cgatgccccg tgtcaacgag 684 0 

ataagcgtgg aagtggactc ggatccccgc gcagcctact tccgccaggc tgagaacggc 6900 

atgtacatcc gcatggctct gttagccacc gtgctgggcc gtttctaggg cctggcttcc 6960 

tcagcctctt ctctttaggc ccagctgctg ggcaaggaat tccagtgcct cctacggggg 7020 

cagcacactt agatattcct ggacatccag atagctcaca tgtgctgacc acacttcagg 7080 

ctctggactg ^agctctctg gcatgggggt ggggcctcag atgctggggc ccagtctgcc 714 0 

ccatcttcat tcctgcacct taaacctgta cagtcatttt tctactgact taataaacag 7200 

ccgagctgtc ccttgatgct gagtgtagta gaacagagct ttcttttaga aaattgagca 7260 

gattcccaaa ttataagagc agcctcacca ggcagggctc tggctagggc tgcgtgccca 7320 

cactcggcat tttcacactc gtgactcttt gggactcggc aggagcgtgt taactctgtg 7380 
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ctctctgagt cctgtgtcgt gctgatgcca gcgcttacct gggggctctg gctggtgtcc 7440 

agccctcctt cctcaattca gttgactcaa agacctgaca gagagttctg ccctcttcct 7500 

gacactggga gaacactcat cagggtaaaa ggaaggctga gcactgcggc ctgtctgagg 7560 

ccaggcctgg actcttcctc actctttctg gttagtcctc ctccagtggc taaggctgcc 7620 

attgcattct caaaaccacc ctggagttct tcagccactg tcactattag gagcccctcc 7680 

tatttcccat ccctgggagc ctctgcctga cattggacag aggggcagat gccatggagg 7740 

ctgaggagca ctttggcttg gagaccccag ggcagcagcc taagggcagt agcttatgtg 7800 

tctgtgggtg agggcttctg aggagcagct gtgccctcca gtgaacttga gtagcattgg 7860 

agaaatctca tttcttgttc caacaaaact tagcactctg tgggtacaaa gcaccatgtg 792 0 

cagtgggagt acaaggaaga acaaggcaat tctcaccctc agttttcagc ctcgggtcat 7980 

aaggcttcca gtggcttcta cagattaaac tgctgcacac tggcacatga tagtgcaaac 804 0 

tgctctgaca ccattcagga atcacctact tcatccagag agtaacagcg tattctagag 8100 

atgagaaaag gagaaggacc tcgttttttg agaacttagt gtgaacagga acttgatgcg 8160 

tattctcatt tctcacacca gccttccacc catttcacag atgtggacac tggctcagag 8220 
aagtggt 



8227 



<210> 23 

<211> 8034 

<212> DNA 

<213> Homo sapien 

<400> 23 

cgcccccgcc tctgagctcc cttcccatgg cggccctagt gttggaggac gggtcggtcc 60 

tgcggggcca gccctttggg gccgccgtgt cgactgccgg ggaagtggtg tttcaaaccg 12 0 

gcatggtcgg ctaccccgag gccctcactg atccctccta caaggcacag atcttagtgc 180 

tcacctatcc tctgatcggc aactatggca tccccccaga tgaaatggat gagttcggtc 24 0 

tctgcaagtg gtttgaatcc tcgggcatcc acgtagcagc actggtagtg ggagagtgct 300 

gtcctactcc cagccactgg agtgccaccc gcaccctgca tgagtggctg cagcagcatg 360 

gcatccctgg cttgcaagga gtagacactc gggagctgac caagaagttg cgggaacagg 42 0 

ggtctctgct ggggaagctg gtccagaatg gaacagaacc ttcatccctg ccattcttgg. 480 

accccaatgc ccgccccctg gtaccagagg tctccattaa gactccacgg gtattcaata 54 0 

cagggggtgc ccctcggatc cttgctttgg actgtggcct caagtataat cagatccgat 600 

gcctctgcca gcgtggggct gaggtcactg tggtaccctg ggaccatgca ctagacagcc 660 

aagagtatga gggtctcttc ttaagtaatg ggcctggtga ccctgcctcc tatcccagtg 720 
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tcgtatccac actgagccgt gttttatctg agcctaatcc ccgacctgtc tttgggatct 780 

gcctgggaca ccagctattg gccttagcca ttggggccaa gacttacaag atgagatatg 840 

ggaaccgagg ccataaccag ccctgcttgt tggtgggctc tgggcgctgc tttctgacat 900 

cccagaacca tgggtttgct gtggagacag actcactgcc agcagactgg gctcctctct 960 

tcaccaacgc caatgatggt tccaatgaag gcattgtgca caacagcttg cctttcttca 1020 

gtgtccagtt tcacccagag caccaagctg gcccttcaga tatggaactg cttttcgata 1080 

tctttctgga aactgtgaaa gaggccacag ctgggaaccc tgggggccag acagttagag 1140 

agcggctgac tgagcgcctc tgtccccctg ggattcccac tcccggctct ggacttccac 1200 

caccacgaaa ggttctgatc ctgggctcag ggggcctctc cattggccaa gctggagaat 1260 

ttgactactc gggctctcag gcaattaagg ccctgaagga ggaaaacatc cagacgttgc 1320 

tgatcaaccc caatattgcc acagtgcaga cctcccaggg gctggccgac aaggtctatt 1380 

ttcttcccat aacacctcat tatgtaaccc aggtgatacg taatgaacgc cccgatggtg 144 0 

tgttactgac ttttgggggc cagactgctc tgaactgtgg tgtggagctg accaaggccg 1500 

gggtgctggc tcggtatggg gtccgggtcc tgggcacaac agtggagacc attgagctga 1560 

ccgaggatcg acgggcctfct gctgccagaa tggcagagat cggagagcat gtggccccga 1620 

gcgaggcagg aaattctctt gaacaggccc aggcagccgc tgaacggctg gggtaccctg 1680 

tgctagtgcg tgcagccttt gccgtgggtg gcctgggctc tggctttgcc tctaacaggg 1740 

aggagctctc tgctctcgtg gccccagctt ttgcccatac cagccaagtg ctagtagaca 1800 

agtctctgaa gggatggaag gagattgagt acgaggtggt gagagacgcc tatggcaact 1860 

gtgtcacggt gtgtaacatg gagaacttgg acccactggg catccacact ggtgagtcca 1920 

tagtggtggc ccctagccag acactgaatg acagggagta tcagctcctg aggcagacag 1980 

ctatcaaggt gacccagcac ctgggaattg ttggggagtg caatgtgcag tatgccttga 2040 

accctgagtc tgagcagtat tacatcattg aagtgaatgc caggctctct cgcagctctg 2100 

ccctggccag taaggccaca ggttatccac tggcttatgt ggcagccaag ctagcattgg 2160 

gcatcccttt gcctgagctc aggaactctg tgacaggggg tacagcagcc tttgaaccca 2220 

gcgtggatta ttgtgtggtg aagattcctc gatgggacct tagcaagttc ctgcgagtca 2280 

gcacaaagat tgggagctgc atgaagagcg ttggtgaagt catgggcatt gggcgttcat 2340 

ttgaggaggc cttccagaag gccctgcgca tggtggatga gaactgtgtg ggctttgatc 2400 

acacagtgaa accagtcagc gatatggagt tggagactcc aacagataag cggatttttg 2460 

tggtggcagc tgctttgtgg gctggttatt cagtggaccg cctgtatgag ctcacacgca 2520 . 
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gttcctgcac cgaatgaagc gtatcatcgc 
acagcctttg ccgccagacc tgctgcaaca 
gattgccctt gcagttctga gcacagagct 
gatctgtcca gcagtgaaac agattgacac 
ttacctatac ctaacgtatt ggggcaccac 
cctagtcctt ggctctggcg tctaccgtat 
tgtaggctgc atccagcagc tccgaaagat 
cccagagaca gtcagcaccg actatgacat 
ttttgaggtg gtgatggaca tctatgagct 
999fcggacag ctgcccaaca acatggccat 
gggcacctcc cctgaagcca ttgactcggc 
tgacaccatt ggtatcagcc agcctcagtg 
ccaattctgc cagaccgtgg ggtacccctg 
tgctgctatg aatgtggcct acacggatgg 
agccgtctcc aaagagcatc ccgtggtcat 
tgacgtggat gccgtggcct ctgatggtgt 
ggagaatgca ggtgtgcatt caggtgatgc 
tgccaaaacc ctggagcgga tcaaagccat 
cacaggaccc ttcaatctgc agctcattgc 
caacgtacgt gtctctcgct ccttcccctt 
agccttggcc acgcgggtca tcatggggga 
ttctggagtc gtgggagtaa aggtgcctca 
cgtggtgttg ggtgtggaaa tgaccagtac 
ctgtgaggca tacctcaagg ccatgctaag 
cctgctgacc attggcagct ataagaacaa 
ggagagcctg ggctacagcc tctatgccag 
tggcgtcaag gtaacagctg tggactggca 
accacagcgg agcatcctgg agcagctagc 
gtcaatgcgt ggagctgggg gccggcgtct 
ccgacgcttg gccgctgact tctccgtgcc 
ctttgtggag gccctaggcc agatcgggcc 
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acatgcccag 


ctgctagaac 


2580 


ggccaagtgt 


cttggcttct 


2640 


ggctgttcgc 


aagctgcgtc 


2700 


agttgcagct 


gagtggccag 


2760 


ccatgacctc 


acctttcgaa 


2820 


tggctctagc gttgaatttg 


2880 


gggatataag accatcatgg 


2940 


gtgtgatcga 


ctctactttg 


3000 


cgagaaccct 


gaaggtgtga 


3060 


ggcgttgcat 


cggcagcagt 


3120 


tgagaaccgt 


ttcaagtttt 


3180 


gagggagctc 


agtgacctcg 


3240 


tgtggtgcgc 


ccctcctatg 


3300 


agacctggag cgcttcctga 


3360 


ctccaagttc 


atccaggagg 


3420 


ggtggcagcc 


atcgccatct 


3480 


gacgctggtg 


acccccccac 


3540 


tgtgcatgct 


gtgggccagg 


3600 


caaggatgac 


cagctgaaag 


3660 


cgtttccaag 


acactgggtg 


3720 


agaagtggaa 


cctgtggggc 


3780 


gttctccttc 


tcccgcttgg 


3840 


tggggaggtg . 


gccggctttg 


3900 


cactggcttt 


aagatcccca 


3960 


aagcgagctg 


ctcccaactg 


4020 


tctcggcaca 


gctgacttct 


4080 


cttfcgaggag gctgtggatg 


4140 


tgagaaaaac 


tttgagctgg 


4-iOO 


ctcttccttt 


gtcaccaagg 


4260 


cctaatcatc 


gatatcaagt 


4320 


agcccctcct 


ttgaaggtgc 


4380 
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atgttgactg tatgacctcc caaaagcttg tgcgactgcc gggattgatt gatgtccatg 4440 
tgcacctgcg ggaaccaggt gggacacata aggaggactt tgcttcaggc acagccgctg 4500 
ccctggctgg gggtatcacc atggtgtgtg ccatgcctaa tacccggccc cccatcattg 4560 

acgcccctgc tctggccctg gcccagaagc tggcagaggc tggcgcccgg tgcgactttg 4620 

cgctattcct tggggcctcg tctgaaaatg caggaacctt gggcaccgtg gccgggtctg 4680 

cagccgggct gaagctttac ctcaatgaga ccttctctga gctgcggctg gacagcgtgg 4 740 

tccagtggat ggagcatttc gagacatggc cctcccacct ccccattgtg gctcacgcag 4800 

agcagcaaac cgtggctgct gtcctcatgg tggctcagct cactcagcgc tcagtgcaca 4860 

tatgtcacgt ggcacggaag gaggagatcc tgctaattaa agctgcaaag gcacggggct 4 920 

tgccagtgac ctgcgaggtg gctccccacc acctgttcct aagccatgat gacctggagc 4980 

gcctggggcc tgggaagggg gaggtccggc ctgagcttgg ctcccgccag gatgtggaag 5040 

ccctgtggga gaacatggct gtcatcgact gctttgcctc agaccatgct ccccatacct 5100 

tggaggagaa gtgtgggtcc aggcccccac ctgggttccc agggttagag accatgctgc 5160 

cactactcct gacggctgta agcgagggcc ggctcagcct ggacgacctg ctgcagcgat 5220 

tgcaccacaa tcctcggcgc atctttcacc tgcccccgca ggaggacacc tatgtggagg 5280 

tggatctgga gcatgagtgg acaattccca gccacatgcc cttctccaag gcccactgga 5340 

caccttttga agggcagaaa gtgaagggca ccgtccgccg tgtggtcctg cgaggggagg 54 00 

ttgcctatat cgatgggcag gttctggtac ccccgggcta tggacaggat gtacggaagt 5460 

ggccacaggg ggctgttcct cagctcccac cctcagcccc tgccactagt gagatgacca 552 0 

cgacacctga aagaccccgc cgtggcatcc cagggcttcc tgatggccgc ttccatctgc 5580 

cgccccgaat ccatcgagcc tccgacccag gtttgccagc tgtgttcctc cgcccaggag 5640 

ctgggatccc acggggcagc agagcgtggg ctgaggagcc aaaggagaag tcctctcgga 5700 

aggtagccga gccagagctg atgggaaccc ctgatggcac ctgctaccct ccaccaccag 5760 

taccgagaca ggcatctccc cagaacctgg ggacccctgg cttgctgcac ccccagacct 5820 

cacccctgct gcactcatta gtgggccaac atatcctgtc cgtccagcag ttcaccaagg 5880 

atcagatgtc tcacctgttc aatgtggcac acacactgcg tatgatggtg cagaaggagc 5940 

ggagcctcga catcctgaag gggaaggtca tggcctccat gttctatgaa gtgagcacac 6000 

ggaccagcag ctcctttgca gcagccatgg cccggctggg aggtgctgtg ctcagcttct 6060 

cggaagccac atcgtccgtc cagaagggcg aatccctggc tgactccgtg cagaccatga 612 0 

gctgctatgc cgacgtcgtc gtgctccggc acccccagcc tggagcagtg gagctggccg 6180 
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ccaagcactg ccggaggcca gtgatcaatg ctggggatgg ggtcggagag caccccaccc 624 0 

aggccctgct ggacatcttc accatccgtg aggagctggg aactgtcaat ggcatgacga 6300 

tcacgatggt gggtgacctg aagcacggac gcacagtaca ttccctggcc tgcctgctca 6360 

cccagtatcg tgtcagcctg cgctacgtgg cacctcccag cctgcgcatg ccacccactg 6420 

tgcgggcctt cgtggcctcc cgcggcacca agcaggagga attcgagagc attgaggagg 6480 

cgctgcctga cactgatgtg ctctacatga ctcgaatcca gaaggaacga tttggctcta 654 0 

cccaggagta cgaagcttgc tttggtcagt tcatcctcac tccccacatc atgacccggg 6600 

ccaagaagaa gatggtggtg atgcacccga tgccccgtgt caacgagata agcgtggaag 6660 

tggactcgga tccccgcgca gcctacttcc gccaggctga gaacggcatg tacatccgca 6720 

tggctctgtt agccaccgtg ctgggccgtt tctagggcct ggcttcctca gcctcttctc 6780 

tttaggccca gctgctgggc aaggaattcc agtgcctcct ac'gggggcag cacacttaga 684 0 

tattcctgga catccagata gctcacatgt gctgaccaca cttcaggctc tggactggag 6900 

ctctctggca tgggggtggg gcctcagatg ctggggccca gtctgcccca tcttcattcc 6960 

tgcaccttaa acctgtacag tcatttttct actgacttaa taaacagccg agctgtccct 7020 

tgatgctgag tgtagtagaa cagagctttc ttttagaaaa ttgagcagat tcccaaatta 7080 

taagagcagc ctcaccaggc agggctctgg ctagggctgc gtgcccacac tcggcatttt 714 0 

cacactcgtg actctttggg actcggcagg agcgtgttaa ctctgtgctc tctgagtcct 72 00 

gtgtcgtgct gatgccagcg cttacctggg ggctctggct ggtgtccagc cctccttcct 7260 

caattcagtt gactcaaaga cctgacagag agttctgccc tcttcctgac actgggagaa 732 0 

cactcatcag ggtaaaagga aggctgagca ctgcggcctg tctgaggcca ggcctggact 7380 

cttcctcact ctttctggtt agtcctcctc cagtggctaa ggctgccatt gcattctcaa 7440 

aaccaccctg gagttcttca gccactgtca ctattaggag cccctcctat ttcccatccc 7500 

tgggagcctc tgcctgacat tggacagagg ggcagatgcc atggaggctg aggagcactt 7560 

tggcttggag accccagggc agcagcctaa gggcagtagc ttatgtgtct gtgggtgagg 7620 

gcttctgagg agcagctgtg ccctccagtg aacttgagta gcattggaga aatctcattt 7680 

cttgttccaa caaaacttag cactctgtgg gtacaaagca ccatgtgcag tgggagtaca 7740 

aggaagaaca aggcaattct caccctcagt tttcagcctc gggtcataag gcttccagtg 7800 

gcttctacag attaaactgc tgcacactgg cacatgatag tgcaaactgc tctgacacca 7860 

ttcaggaatc acctacttca tccagagagt aacagcgtat tctagagatg agaaaaggag 7920 

aaggacctcg tttfcttgaga acttagtgtg aacaggaact tgatgcgtat tctcatttct 7980 

cacaccagcc ttccacccat ttcacagatg tggacactgg ctcagagaag tggt 8034 
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<210> 24 
<211> 3036 
<212> DNA 
<213> Homo sapien 

<400> 24 

aactctggcc aacgttatcc ctccagacac ctgaaagacc ccgccgtggc atcccagggc 60 

ttcctgatgg ccgcttccat ctgccgcccc gaatccatcg agcctccgac ccaggtttgc 120 

caggtaagag tggggttcct gtgactcaga gactgtgtag ggacaggatc cacttcttcc 180 

cagtgcctcg cctttctcta cttacatgtc ctcctctcca tccctttatc ctcgtctgat 240 

ctgccgtgcc atcctttgcc taaacaagtc tccccggtgt gagtagacat ctcacagctt 300 

ctcacatgcc cttttttgtt gtgggcagct gtgttcctcc gcccaggagc tgggatccca 360 

cggggcagca gagcgtgggg taaatccagg ttgttggttg gtgtgagtct ggccgttcct 420 

cttgcctagg atgcctttgc caactgggag agccccggga gggcaccact gctcccatac 480 

aacacagccc catgggaagc cacctgtctg tccccagagc tgctgcagtc ccctgccctc 540 

ccctaacctg ctatattact gtgttgtgaa ttggtttaac atatacactg tgattcagag 600 

ttcctgggaa tatggggttt ctttccaaac ctcagccata aatgtatatc tgtcctcttg 660 

tcctgtttgc agctgaggag ccaaaggaga agtcctctcg gaaggtagcc gagccagagc 720 

tgatgggaac ccctgatggc acctgctacc ctccaccacc agtaccgaga caggcatctc 780 

cccagaacct ggggacccct ggcttgctgc acccccagac ctcacccctg ctgcactcat 840 

tagtgggcca acatatcctg tccgtccagc agttcaccaa ggatcagatg tctcacctgt 900 

tcaatgtggc acacacactg cgtatgatgg tgcagaagga gcggagcctc gacatcctga 960 

aggggaaggt catggcctcc atgttctatg aagtgagcac acggaccagc agctcctttg 1020 

cagcagccat ggcccggctg ggaggtgctg tgctcagctt ctcggaagcc acatcgtccg 1080 

tccagaaggg cgaatccctg gctgactccg tgcagaccat gagctgctat gccgacgtcg 1140 

tcgtgctccg gcacccccag cctggagcag tggagctggc cgccaagcac tgccggaggc 12 00 

cagtgatcaa tgctggggat ggggtcggag agcaccccac ccaggccctg ctggacatct 1260 

tcaccatccg tgaggagctg ggaactgtca atggcatgac gatcacgatg gtgggtgacc 1320 

tgaagcacgg acgcacagta cattccctgg cctgcctgct cacccagtat cgtgtcagcc 1380 

tgcgctacgt ggcacctccc agcctgcgca tgccacccac tgtgcgggcc ttcgtggcct 144 0 

cccgcggcac caagcaggag gaattcgaga gcattgagga ggcgctgcct gacactgatg 1500 

tgctctacat gactcgaatc cagaaggaac gatttggctc tacccaggag tacgaagctt 1560 

gctttggtca gttcatcctc actccccaca tcatgacccg ggccaagaag aagatggtgg 1620 



WO 2004/050858 



40/383 



PCT/US2003/038808 



tgatgcaccc gatgccccgt gtcaacgaga taagcgtgga agtggactcg gatccccgcg 1680 

cagcctactt ccgccaggct gagaacggca tgtacatccg catggctctg ttagccaccg 174 0 

tgctgggccg tttctagggc ctggcttcct cagcctcttc tctttaggcc cagctgctgg 1800 

gcaaggaatt ccagtgcctc ctacgggggc agcacactta gatattcctg gacatccaga 1860 

tagctcacat gtgctgacca cacttcaggc tctggactgg agctctctgg catgggggtg 1920 

gggcctcaga tgctggggcc cagtctgccc catcttcatt cctgcacctt aaacctgtac 1980 

agtcattttt ctactgactt aataaacagc cgagctgtcc cttgatgctg agtgtagtag 2040 

aacagagctt tcttttagaa aattgagcag attcccaaat tataagagca gcctcaccag 2100 

gcagggctct ggctagggct gcgtgcccac actcggcatt ttcacactcg tgactctttg 2160 

ggactcggca ggagcgtgtt aactctgtgc tctctgagtc ctgtgtcgtg ctgatgccag 2220 

cgcttacctg ggggctctgg ctggtgtcca gccctccttc ctcaattcag ttgactcaaa 2280 

gacctgacag agagttctgc cctcttcctg acactgggag aacactcatc agggtaaaag 234 0 

gaaggctgag cactgcggcc tgtctgaggc caggcctgga ctcttcctca ctctttctgg 2400 

ttagtcctcc tccagtggct aaggctgcca ttgcattctc aaaaccaccc tggagttctt 2460 

cagccactgt cactattagg agcccctcct atttcccatc cctgggagcc tctgcctgac 2520 

attggacaga ggggcagatg ccatggaggc tgaggagcac tttggcttgg agaccccagg 2580 

gcagcagcct aagggcagta gcttatgtgt ctgtgggtga gggcttctga ggagcagctg 2640 

tgccctccag tgaacttgag tagcattgga gaaatctcat ttcttgttcc aacaaaactt 2700 

agcactctgt gggtacaaag caccatgtgc agtgggagta caaggaagaa caaggcaatt 2760 

ctcaccctca gttttcagcc tcgggtcata aggcttccag tggcttctac agattaaact 2820 

gctgcacact ggcacatgat agtgcaaact gctctgacac cattcaggaa tcacctactt 2880 

catccagaga gtaacagcgt attctagaga tgagaaaagg agaaggacct cgttttttga 2940 

gaacttagtg tgaacaggaa cttgatgcgt attctcattt ctcacaccag ccttccaccc 3 000 

atttcacaga tgtggacact ggctcagaga agtggt 3036 

<210> 25 

<211> 3637 

<212> DNA 

<213> Homo sapien 

<400> 25 

agctgttccc cgcgctgcgc ggagcaggga ccaggcggtt gcggcggcga cagccatggc 60 

cggcgcgctg gcaggtctgg ccgcgggctt gcaggtcccg cgggtcgcgc ccagcccaga 120 

ctcggactcg gacacagact cggaggaccc gagtctccgg cgcagcgcgg gcggcttgct 180 
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' ■ i 

ccgctcgcag 

I 

j gccccggcgg 

| cgaccccaca 

I 

J gtctcccaag 

gaacaacgcc 
gtgtggcttc 
cgtggtcctg 
caagtggcgc 

| ctttcctcct 
j gacccccgag 
j gtggccacag 

i 

j ggaagatgac 
I gtgcaaacag 

j gggtgggcgg 

! 

caccatgact 

] 

! caatgctgac 
i ggacatctca 

cccagagccc 
cccagaggtg 
ggctcggaac 
ccttcctgaa 
ttaccctccc 
catggcacca 
ccccaccgtc 
caccccacag 

i 

J ggggtattcg 

i 

cccatcccct 
ccccaccact 

i 

\ ggagcaagcc 

t 

gcaggagaca 



gtcatccaca gcggtcactt catggtgtcg 
cgcgaccagg aggggtccgt ggggccctcc 
ctcacacgcc tcttcgagtg cttgagcctg 
tggaagaatt tcaaaggcct caagctgctc 
atctggaggg cctggtatat ccagtatgtg 
gtgacccccc tgcaggggcc tgaggctgat 
gaggggaact actggaagcg gcgcatcgag 
atctactaca agaagcgggt cagtggggga 
gcggctgccg gctaccgccc gcctcggaag 
cttgcgcccc tcgggccctc cattcagtcc 
cgtctcctag cggcctcctt acctaggggt 
ctcctggccc ctaagcaggc ggaaggcagg 
ctcttctcca gtgtggtccc cgtgctgctg 
cagctcctgg acctcaattg ctttttgtcc 
cagtccggcc cttcgcccct gcagctgccg 
atgatccagc cggacctgac gccactgcag 
gatttcttta ccaactcccg cctcccacag 
cccagcttca gccccgtggt tgactccctc 
cccccggctt cctcggccat gacccacctc 
agctgccctg gccccttgga ctccagcgcc 
gaccccaagc cccggctccc accccctcct 
cctgccaagg tgccaggcct ggagccctgc 
cccactgctt tgctgcagga agagcctctc 
cctcctgccc caggagtgtc tccgctgcct 
tctgtcccca gcccagcccc cacccccttc 
gagcctgcct ttgggccttg cttctccatg 
aggggacaga aagccagccc ccctacctta 
gcggggagca acaacccctg cctcacacag 
ctggagccac cacttgtatc cagcaccctc 
gtccctgaat tcccctgcac attccttccc 



tcgccgcaca gcgactcgct 


240 


gacttcgggc cgcgcagtat 


300 


gcctacagtg gcaagctggt 


360 


tgcagagaca agatccgcct 


420 


aagcggagga agagccccgt 


480 


gcgcaccgga agccggaggc 


540 


gtggtgatgc gggaatacca 


600 


gggccaggga ggccccagag 


660 


atccctggaa aggggatcct 


720 


cgggccgaca gcgccaccgt 


780 


cggctccgta agcccagcag 


840 


tggccgccgc cggagcaatg 


900 


ggggacccag aggaggagcc 


960 


gacatctcgg acactctctt 


1020 


cctgaggatg cctacgtcgg 


1080 


ccaagcctgg atgacttcat 


1140 


ccgcccatgc cttcaaactt 


1200 


ttcagcagtg ggaccctggg 


1260 


tctggacaca gccgtctgca 


1320 


ttcctgagtt ctgatttcct 


1380 


gtacccccac ctctgctgca 


1440 


cccccacctc ccttccctcc 


1500 


ttctctccca ggtttccctt 


1560 


gctcctgcag ccttcccacc 


1620 


cccatagagc ttctaccctt 


1680 


cccagaggca agccccccgc 


1740 


gcccctgcca ctgccagtcc 


1800 


ctgctcacag cagctaagcc 


1860 


ctccggtccc cagggtcccc 


1920 


ccgaccccgg cccctacacc 


1980 
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gccccggcca 


cctccaggcc 


cggccacatt 


ggccccttcc 


aggcccctgc 


ttgtccccaa 


2040 


agcggagcgg 


ctctcacccc 


cagcgcccag 


cggtaaagag aggggttgca gggattgggg 


2100 


gaatctaggg 


gatgagggga 


agaggaggaa 


gaagaagggg actgggtctg gggccagatg 


2160 


agaggaaggg 


ggcaatgggg 


accctttccc 


ctgggtcggt 


ctttctgtgg gtctgtctgt 


2220 


ccctaccaca 


ggcagtgaac 


ggcggctgtc 


aggggacctc 


agctccatgc 


caggccctgg 


2280 


gactctgagc 


gtccgtgtct 


ctcccccgca 


acccatcctc 


agccggggcc 


gtccagacag 


2340 


caacaagacc 


gagaaccggc 


gtatcacaca 


catctccgcg gagcagaagc ggcgcttcaa 


2400 


catcaagctg 


gggtttgaca 


cccttcatgg 


gctcgtgagc 


acactcagtg cccagcccag 


2460 


cctcaaggtg 


agcaaagcta 


ccacgctgca 


gaagacagct 


gagtacatcc 


ttatgctaca 


2520 


gcaggagcgt 


gcgggcttgc 


aggaggaggc 


ccagcagctg 


cgggatgaga 


ttgaggagct 


2580 


caatgccgcc 


attaacctgt 


gccagcagca 


gctgcccgcc 


acaggggtac 


ccatcacaca 


2640 


ccagcgtttt 


gaccagatgc 


gagacatgtt 


tgatgactac 


gtccgaaccc 


gtacgctgca 


2700 


caactggaag 


ttctgggtgt 


tcagcatcct 


catccggcct 


ctgtttgagt 


ccttcaacgg 


2760 


gatggtgtcc 


acggcaagtg 


tgcacaccct 


ccgccagacc 


tcactggcct 


ggctggacca 


2820 


gtactgctct 


ctgcccgctc 


tccggccaac 


tgtcctgaac 


tccctacgcc 


agctgggcac 


2880 


atctaccagt 


atcctgaccg 


acccgggccg 


catccctgag 


caagccacac gggcagtcac 


2940 


agagggcacc 


cttggcaaac 


ctttatagtc 


ctggccagac 


cctgctgctc 


actcagctgc 


3000 


cctgggggct 


gctttccctg 


ggcacgggct 


ccagggatca 


tctctgggca 


ctcccttcct 


3060 


gccccaggcc 


ctggctctgc 


ccttccctgg 


ggggtggagc 


agggtccagg 


tttcacactt 


3120 


gccacctcct 


ggaggtcaag 


aagagcagag 


tccccgtccc 


tgctctgcca 


ctgtgctcca 


3180 


gcaccgtgac 


cttgggtgac 


tcgtccgctg 


tctttggacc 


gctgtgtttc 


aatctgcaaa 


3240 


atggggatgg 


ggaaggttca 


atcagcagat 


gacccccagg 


ccttggcagc 


tgtgacattg 


3300 


ggggcctagg 


ctggcaactc 


cgggggctca 


acggtggaaa 


gaggaggatg 


ctgtttctct 


3360 


gtcacctcca 


cttgctcccc 


gacaggtggg 


gcacagacct 


ctgttcctga 


gcagagaagc 


3420 


aaaaaaoaao 


gt tccctctc 


tctgctcctt 


cactgctgac 


ccagaggggc 


tgcaggatgg 


3480 


tttcccctgg 


gagaggccag 




tcccaggaga 


caccagggcc 


agagtgacca 


3540 


cagcagggca 


ggcatcatgt 


gtgtgtgtgt 


gtgtggatgt 


gtgtgtgtgg gttttgtaaa 


3600 


gaattcttga 


ccaataaaag 


caaaaactgt 


ctgctgg 






3637 



<210> 26 

<211> 2763 

<212> DNA 

<213> Homo sapien 
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<400> 26 

ctcatcgaag cccctgggga cagaaagagg gtccctggga taaagcttag tcttagtgac 60 

ctcctaattc cttccccacg gggtcctggg cctgtcaatc aagaggtgga ggcctatggg 120 

gcgggaacta gggaatgagg aaccaccccc acccctgttc cagtcccggg agggggtggg 180 

ggctgcccag gggcagaagt gtgcatggca gtgccctggg agcaggccag cctgggaggg 24 0 

cagcggccct gtctgggcac acagcgcccg gccacagatt tctttaccaa ctcccgcctc 3 00 

ccacagccgc ccatgccttc aaacttccca gagcccccca gcttcagccc cgtggttgac 360 

tccctcttca gcagtgggac cctgggccca gaggtgcccc cggcttcctc ggccatgacc 420 

cacctctctg gacacagccg tctgcaggct cggaacagct gccctggccc cttggactcc 480 

agcgccttcc tgagttctga tttcctcctt cctgaagacc ccaagccccg gctcccaccc 540 

cctcctgtac ccccacctct gctgcattac cctccccctg ccaaggtgcc aggcctggag 600 

ccctgccccc cacctccctt ccctcccatg gcaccaccca ctgctttgct gcaggaagag 660 

cctctcttct ctcccaggtt tcccttcccc accgtccctc ctgccccagg agtgtctccg 720 

ctgcctgctc ctgcagcctt cccacccacc ccacagtctg tccccagccc agcccccacc 780 

cccttcccca tagagcttct acccttgggg tattcggagc ctgcctttgg gccttgcttc 840 

tccatgccca gaggcaagcc ccccgcccca tcccctaggg gacagaaagc cagcccccct 900 

accttagccc ctgccactgc cagtcccccc accactgcgg ggagcaacaa cccctgcctc 960 

acacagctgc tcacagcagc taagccggag caagccctgg agccaccact tgtatccagc 1020 

accctcctcc ggtccccagg gtccccgcag gagacagtcc ctgaattccc ctgcacattc 1080 

cttcccccga ccccggcccc tacaccgccc cggccacctc caggcccggc cacattggcc 114 0 

ccttccaggc ccctgcttgt ccccaaagcg gagcggctct cacccccagc gcccagcggt 1200 

aaagagaggg gttgcaggga ttgggggaat ctaggggatg aggggaagag gaggaagaag 1260 

aaggggactg ggtctggggc cagatgagag gaagggggca atggggaccc tttcccctgg 1320 

gtcggtcttt ctgtgggtct gtctgtccct accacaggca gtgaacggcg gctgtcaggg 1380 

gacctcagct ccatgccagg ccctgggact ctgagcgtcc gtgtctctcc cccgcaaccc 1440 

atcctcagcc ggggccgtcc agacagcaac aagaccgaga accggcgtat cacacacatc 1500 

tccgcggagc agaagcggcg cttcaacatc aagctggggt ttgacaccct tcatgggctc 1560 

gtgagcacac tcagtgccca gcccagcctc aaggtgagca aagctaccac gctgcagaag 1620 

acagctgagt acatccttat gctacagcag gagcgtgcgg gcttgcagga ggaggcccag 1680 

cagctgcggg atgagattga ggagctcaat gccgccatta acctgtgcca gcagcagctg 1740 

cccgccacag gggtacccat cacacaccag cgttttgacc agatgcgaga catgtttgat 1800 
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gactacgtcc gaacccgtac gctgcacaac tggaagttct gggtgttcag catcctcatc 1860 

cggcctctgt ttgagtcctt caacgggatg gtgtccacgg caagtgtgca caccctccgc 1920 

cagacctcac tggcctggct ggaccagtac tgctctctgc ccgctctccg gccaactgtc 1980 

ctgaactccc tacgccagct gggcacatct accagtatcc tgaccgaccc gggccgcatc 2040 

cctgagcaag ccacacgggc agtcacagag ggcacccttg gcaaaccttt atagtcctgg 2100 

ccagaccctg ctgctcactc agctgccctg ggggctgctt tccctgggca cgggctccag 2160 

ggatcatctc tgggcactcc cttcctgccc caggccctgg ctctgccctt ccctgggggg 2220 

tggagcaggg tccaggtttc acacttgcca cctcctggag gtcaagaaga gcagagtccc 2280 

cgtccctgct ctgccactgt gctccagcac cgtgaccttg ggtgactcgt ccgcfcgtctt 2340 

tggaccgctg tgtttcaatc tgcaaaatgg ggatggggaa ggttcaatca gcagatgacc 2400 

cccaggcctt ggcagctgtg acattggggg cctaggctgg caactccggg ggctcaacgg 2460 

tggaaagagg aggatgctgt ttctctgtca cctccacttg ctccccgaca ggtggggcac 2520 

agacctctgt tcctgagcag agaagcagaa aaggaggttc cctctctctg ctccttcact 2580 

gctgacccag aggggctgca ggatggtttc ccctgggaga ggccaggagg gcctgatccc 264 0 

aggagacacc agggccagag tgaccacagc agggcaggca tcatgtgtgt gtgtgtgtgt 2700 

ggatgtgtgt gtgtgggttt tgtaaagaat tcttgaccaa taaaagcaaa aactgtctgc 2760 

fc 99 2763 

<210> 27 
<211> 3837 
<212> DNA 
<213> Homo sapien 

<400> 27 

agctgttccc cgcgctgcgc ggagcaggga ccaggcggtt gcggcggcga cagccatggc 60 

cggcgcgctg gcaggtctgg ccgcgggctt gcaggtcccg cgggtcgcgc ccagcccaga 120 

ctcggactcg gacacagact cggaggaccc gagtctccgg cgcagcgcgg gcggcttgct 180 

ccgctcgcag gtcatccaca gcggtcactt catggtgtcg tcgccgcaca gcgactcgct 240 

gccccggcgg cgcgaccagg aggggtccgt ggggccctcc gacttcgggc cgcgcagtat 300 

cgaccccaca ctcacacgcc tcttcgagtg cttgagcctg gcctacagtg gcaagctggt 360 

gtctcccaag tggaagaatt tcaaaggcct caagctgctc tgcagagaca agatccgcct 420 

gaacaacgcc atctggaggg cctggtatat ccagtatgtg aagcggagga agagccccgt 480 

gtgtggcttc gtgacccccc tgcaggggcc tgaggctgat gcgcaccgga agccggaggc 54 0 

cgtggtcctg gaggggaact actggaagcg gcgcatcgag gtggtgatgc gggaatacca 600 
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caagtggcgc atctactaca agaagcgggt cagtggggga gggccaggga ggccccagag 660 

ctttcctcct gcggctgccg gctaccgccc gcctcggaag atccctggaa aggggatcct 720 

gacccccgag cttgcgcccc tcgggccctc cattcagtcc cgggccgaca gcgccaccgt 780 

gtggccacag cgtctcctag cggcctcctt acctaggggt cgggtgagct cctgatggga 840 

aatgggggat ctcatcgctt gtgagtagag gagactttgg ggggaaagtg atggaggatg 900 

gggcaaggga tccggtgtcc aactctgtgt gtccctgcag ctccgtaagc ccagcaggga 960 

agatgacctc ctggccccta agcaggcgga aggcaggtgg ccgccgccgg agcaatggtg 1020 

caaacagctc ttctccagtg tggtccccgt gctgctgggg gacccagagg aggagccggg 1080 

tgggcggcag ctcctggacc tcaattgctt tttgtccgac atctcggaca ctctcttcac 1140 

catgactcag tccggccctt cgcccctgca gctgccgcct gaggatgcct acgtcggcaa 1200 

tgctgacatg atccagccgg acctgacgcc actgcagcca agcctggatg acttcatgga 1260 

catctcagat ttctttacca actcccgcct cccacagccg cccatgcctt caaacttccc 1320 

agagcccccc agcttcagcc ccgtggttga ctccctcttc agcagtggga ccctgggccc 13 80 

agaggtgccc ccggcttcct cggccatgac ccacctctct ggacacagcc gtctgcaggc 1440 

tcggaacagc tgccctggcc ccttggactc cagcgccttc ctgagttctg atttcctcct 1500 

tcctgaagac cccaagcccc ggctcccacc ccctcctgta cccccacctc tgctgcatta 1560 

ccctccccct gccaaggtgc caggcctgga gccctgcccc ccacctccct tccctcccat 1620 

ggcaccaccc actgctttgc tgcaggaaga gcctctcttc tctcccaggt ttcccttccc 1680 

caccgtccct cctgccccag gagtgtctcc gctgcctgct cctgcagcct tcccacccac 1740 

cccacagtct gtccccagcc cagcccccac ccccttcccc atagagcttc tacccttggg 1800 

gtattcggag cctgcctttg ggccttgctt ctccatgccc agaggcaagc cccccgcccc 1860 

atcccctagg ggacagaaag ccagcccccc taccttagcc cctgccactg ccagtccccc 1920 

caccactgcg gggagcaaca acccctgcct cacacagctg ctcacagcag ctaagccgga 1980 

gcaagccctg gagccaccac ttgtatccag caccctcctc cggtccccag ggtccccgca 2040 

ggagacagtc cctgaattcc cctgcacatt ccttcccccg accccggccc ctacaccgcc 2100 

ccggccacct ccaggcccgg ccacattggc cccttccagg cccctgcttg tccccaaagc 2160 

ggagcggctc tcacccccag cgcccagcgg taaagagagg ggttgcaggg attgggggaa 2220 

tctaggggat gaggggaaga ggaggaagaa gaaggggact gggtctgggg ccagatgaga 2280 

ggaagggggc aatggggacc ctttcccctg ggtcggtctt tctgtgggtc tgtctgtccc 2340 

taccacaggc agtgaacggc ggctgtcagg ggacctcagc tccatgccag gccctgggac 24 00 
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tctgagcgtc cgtgtctctc ccccgcaacc catcctcagc cggggccgtc cagacagcaa 2460 

caagaccgag aaccggcgta tcacacacat ctccgcggag cagaagcggc gcttcaacat 2520 

caagctgggg tttgacaccc ttcatgggct cgtgagcaca ctcagtgccc agcccagcct 2580 

caaggtgagc aaagcfcacca cgctgcagaa gacagctgag tacatcctta tgctacagca 264 0 

ggagcgtgcg ggcttgcagg aggaggccca gcagctgcgg gatgagattg aggagctcaa 2700 

tgccgccatt aacctgtgcc agcagcagct gcccgccaca ggggtaccca tcacacacca 2760 

gcgttttgac cagatgcgag acatgtttga tgactacgtc cgaacccgta cgctgcacaa 2820 

ctggaagttc tgggtgttca gcatcctcat ccggcctctg tttgagtcct tcaacgggat ' 2880 

ggtgtccacg gcaagtgtgc acaccctccg ccagacctca ctggcctggc tggaccagta 2940 

ctgctctctg cccgctctcc ggccaactgt cctgaactcc ctacgccagc tgggcacatc 3000 

taccagtatc ctgaccgacc cgggccgcat ccctgagcaa gccacacggg cagtcacaga 3060 

gggcaccctt ggcaaacctt tatagtcctg gccagaccct gctgctcact cagctgccct 3120 

gggggctgct ttccctgggc acgggctcca gggatcatct ctgggcactc ccttcctgcc 3180 

ccaggccctg gctctgccct tccctggggg gtggagcagg gfcccaggttt cacacttgcc 3240 

acctcctgga ggtcaagaag agcagagtcc ccgtccctgc tctgccactg tgctccagca 33 00 

ccgtgaccfct gggtgactcg tccgctgtct ttggaccgct gtgtttcaat ctgcaaaatg 3360 

gggatgggga aggttcaatc agcagatgac ccccaggcct tggcagctgt gacattgggg 3420 

gcctaggctg gcaactccgg gggctcaacg gtggaaagag gaggatgctg tttctctgtc 3480 

acctccactt gctccccgac aggtggggca cagacctctg ttcctgagca gagaagcaga 3540 

aaaggaggtt ccctctctct gctccttcac tgctgaccca gaggggctgc aggatggttt 3600 

cccctgggag aggccaggag ggcctgatcc caggagacac cagggccaga gtgaccacag 3660 

cagggcaggc atcatgtgtg tgtgtgtgtg tggatgtgtg tgtgtgggtt ttgtaaagaa 3720 

ttcttgacca ataaaagcaa aaactgtctg ctggaaaaaa aaaaaatact tgaaaaactg 3780 

aaagcttaat ggagcgttac taaccttaaa agaaaggagt actttgacat ttaaaaa 3837 

<210> 28 
<211> 1060 
<212> DNA 
<213> Homo sapien 

<400> 28 

gttctccctt gtcctggccc tgggcctcta gtcacttcct gcttcccacc aacagtttct 60 

ccttaccccc acccaggtcc tggatggtcg ccggcccaca ggggggcgac tggaggtaat 120 

ggtccggatt cgggagccac tgacagccca gcagttggag acgacgacag agagcaggtt 180 
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300 
360 
420 
480 



gctgggccca aagggaaggc ccctcctgtg cctgcccctg caagggagtc agggaacaga 24 0 
tcagcccggc ccctgcatag cctcagtgtg ctggcgtttg accaagagcg tctggagcgg 
aagatcctgg ccctcaggca ggcgcggcgg ccggtgcccc cagaagtggc ccagcagtac 
caggacatca tgcaacgcag ccagtggcag agggcacagc tggagcaggg gggtgtgggc 
atccgacggg aatacgcagc ccagctggag cggcagctgc agttctacac ggaggctgcc 

cggcgcctgg gcaacgatgg cagcagggat gctgcaaagg aggcgctcta taggcggaat 540 

ctggtagaga gtgagctgca gcggctccgc aggtgaggag cccatggggc gggcagcccc 600 

cagaaagcgg gcagcaggcc ccgataccgg gaagagccga cacagccacg aaccagacaa 660 

gcagacaatc agcggacaat cggttctgga ctcacccctc atccgggccc ccagccccgc 720 

cagagcctcc gtggctgcgg gtgttgggaa ccatgcctgc cagccagtat gtgcccctca 780 

cccaggcctg gctgggccct ggagagtcct gtttgcacag cccaggggtg tccggcctct 84 0 

ggcccgcccc ggagcaggga gggtggctgg ggccaagccc cgagggcccc tgcaagcact 900 

ttacttcctg ttcctcccca gccttaaccc caaagccctc ctgcacccca aagaagccac 960 

tgaggctggc cgagccacac tgtctcccca ggggcgtcga cctggcccag ctgggtcccc 1020 
agggccagca catggaataa aatagccagg gccacactca 



1060 



. <210> 29 
<211> 840 
<212> DNA 
<213> Homo sapien 

<400> 29 

gggaggggct cggggatggg cacctggagg gggaggggct cggggatggg- cacctggagg 60 

gggaggggct cggggatggg cacctggagg gggaggggct cggggatggg cacctggcgg 120 

gggacgggct tggggatggg cacagtccag gagggttgtg ggcagtgagg ccccacccta 180 

agcctccatt cccccgccat ccattctcag aatacgcagc ccagctggag cggcagctgc 240 

agttctacac ggaggctgcc cggcgcctgg gcaacgatgg cagcagggat gctgcaaagg 300 

aggcgctcta taggcggaat ctggtagaga gtgagctgca gcggctccgc aggtgaggag 360 

cccatggggc gggcagcccc cagaaagcgg gcagcaggcc ccgataccgg gaagagccga 420 

cacagccacg aaccagacaa gcagacaatc agcggacaat cggttctgga ctcacccctc 480 

atccgggccc ccagccccgc cagagcctcc gtggctgcgg gtgttgggaa ccatgcctgc 540 

cagccagtat gtgcccctca cccaggcctg gctgggccct ggagagtcct gtttgcacag 600 

cccaggggtg tccggcctct ggcccgcccc ggagcaggga gggtggctgg ggccaagccc 660 

cgagggcccc tgcaagcact ttacttcctg ttcctcccca gccttaaccc caaagccctc 720 
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ctgcacccca aagaagccac tgaggctggc cgagccacac tgtctcccca ggggcgtcga 780 

cctggcccag ctgggtcccc agggccagca catggaataa aatagccagg gccacactca 84 0 

<210> 30 

<211> 927 

<212> DNA 

<213> Homo sapien 

<400> 30 

gggaggggct cggggatggg cacctggagg gggaggggct cggggatggg cacctggagg 60 

gggaggggct cggggatggg cacctggagg gggaggggct cggggatggg cacctggcgg 120 

gggacgggct tggggatggg cacagtccag gagggttgtg ggcagtgagg ccccacccta 180 

agcctccatt cccccgccat ccattctcag aatacgcagc ccagctggag cggcagctgc 240 

agttctacac ggaggctgcc cggcgcctgg gcaacgatgg cagcagggtg agctggtcgc 3 00 

gggccgggtg ggcactgggc agcgggcagg gtggggcctg cagggactac ctgctgaatg 360 

cccatccccc acaggatgct gcaaaggagg cgctctatag gcggaatctg gtagagagtg 420 

agctgcagcg gctccgcagg tgaggagccc atggggcggg cagcccccag aaagcgggca 4 80 

gcaggccccg ataccgggaa gagccgacac agccacgaac cagacaagca gacaatcagc 540 

ggacaatcgg ttctggactc acccctcatc cgggccccca gccccgccag agcctccgtg 600 

gctgcgggtg ttgggaacca tgcctgccag ccagtatgtg cccctcaccc aggcctggct 660 

gggccctgga gagtcctgtt tgcacagccc aggggtgtcc ggcctctggc ccgccccgga 72 0 

gcagggaggg tggctggggc caagccccga gggcccctgc aagcacttta cttcctgttc 780 

ctccccagcc ttaaccccaa agccctcctg caccccaaag aagccactga ggctggccga 84 0 

gccacactgt ctccccaggg gcgtcgacct ggcccagctg ggtccccagg gccagcacat 900 

ggaataaaat agccagggcc acactca 92 7 

<210> 31 
<211> 3373 
<212> DNA 
<213> Homo sapien 

<400> 31 

cagcccctgg agcttctccg ggcagaagag gaggtgggct ggactgtcaa agaaagtggc 60 

tggcttccgt gggcaatccg aggttcctag tggccctctg cagaaagctg aactcccagt 120 

tcccccaggc ctccgtcccg cggcggactc ctaggcttcc tccctccctg tcgccgccag ia0 

cctctcgctc tccccgtctt ccgcccgcac tctctgggca gggctgcggc caggacccgc 240 

ccccgcgtcc cgcccgaccc tccgccaggg gtcacttccc ctgtccaggt ttcagcttcc 3 00 

acatgtgtca agcggctggc tcagcccaga gtccctgtct cccgcccgcc ggcccgagcc 360 



WO 2004/050858 



49/383 



PCT/US2003/038808 



gccgcccctc ccccgcctcc cgtgcgcccg ggacaatcct cgccttgtct gtggcgccgg 420 

catctggagc tttctgtagc ctccggatac gccttttttt cagggcgtag ccccagccaa 480 

gctgctcccc gcggcggccg cacagcagcc cgagcgcccc ctttccagag ctcccctccg 540 

gagctgggat ccaggcgcgt agcggagatc ccaggatcct gggtgctgtc tgggcccgct 600 

ccccaccatg acctcctcgg ggcctggacc ccggttcctg ctgctgctgc cgctgctgct 660 

gccccctgcg gcctcagcct ccgaccggcc ccggggccga gacccggtca acccagagaa 720 

gctgctggtg atcactgtgg ccacagctga aaccgagggg tacctgcgtt tcctgcgctc 780 

tgcggagttc ttcaactaca ctgtgcggac cctgggcctg ggagaggagt ggcgaggggg 840 

tgatgtggct cgaacagttg gtggaggaca gaaggtccgg tggttaaaga aggaaatgga 900 

gaaatacgct gaccgggagg atatgatcat catgtttgtg gatagctacg acgtgattct 960 

ggccggcagc cccacagagc tgctgaagaa gttcgtccag agtggcagcc gcctgctctt 1020 

ctctgcagag agcttctgct ggcccgagtg ggggctggcg gagcagtacc ctgaggtggg 1080 

cacggggaag cgcttcctca attctggtgg attcatcggt tttgccacca ccatccacca 1140 

aatcgtgcgc cagtggaagt acaaggatga tgacgacgac cagctgttct acacacggct 1200 

ctacctggac ccaggactga gggagaaact cagccttaat ctggatcata agtctcggat 1260 

ctttcagaac ctcaacgggg ctttagatga agtggtttta aagtttgatc ggaaccgtgt 1320 

gcgtatccgg aacgtggcct acgacacgct ccccattgtg gtccatggaa acggtcccac 1380 

taagctgcag ctcaactacc tgggaaacta cgtccccaat ggctggactc ctgagggagg 1440 

ctgtggcttc tgcaaccagg accggaggac actcccgggg gggcagcctc ccccccgggt 1500 

gtttctggcc gtgtttgtgg aacagcctac tccgtttctg ccccgcttcc tgcagcggct 1560 

gctactcctg gactatcccc ccgacagggt cacccttttc ctgcacaaca acgaggtctt 1620 

ccatgaaccc cacatcgctg actcctggcc gcagctccag gaccacttct cagctgtgaa 1680 

gctcgtgggg ccggaggagg ctctgagccc aggcgaggcc agggacatgg ccatggacct 174 0 

gtgtcggcag gaccccgagt gtgagttcta cttcagcctg gacgccgacg ctgtcctcac 1800 

caacctgcag accctgcgta tcctcattga ggagaacagg aaggtgatcg cccccatgct 1860 

gtcccgccac ggcaagctgt ggtccaactt ctggggcgcc ctgagccccg atgagtacta 1920 

cgcccgctcc gaggactacg tggagctggt gcagcggaag cgagtgggtg tgtggaatgt 1980 

accatacatc tcccaggcct atgtgatccg gggtgatacc ctgcggatgg agctgcccca 2040 

gagggatgtg ttctcgggca gtgacacaga cccggacatg gccttctgta agagctttcg 2100 

agacaagggc atcttcctcc atctgagcaa tcagcatgaa tttggccggc tcctggccac 2160 
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ttccagatac 


gacacggagc 


acctgcaccc cgacctctgg 


cagatcttcg acaaccccgt 


2220 


cgactggaag 


gagcagtaca 


tccacgagaa ctacagccgg gccctggaag gggaaggaat 


2280 


cgtggagcag 


ccatgcccgg 


acgtgtactg gttcccactg 


ctgtcagaac 


aaatgtgtga 


2340 


tgagctggtg 


gcagagatgg 


agcactacgg ccagtggtca ggcggccggc atgagactgc 


2400 


actgcaggaa 


tgcctcatgc 


cggtcgagct 


ctgccctgct 


gtgagcagca 


gccaccacac 


2460 


agcttctaag 


gctgcgatga 


gcagcctggg 


tgcaatagaa 


gccacgggga 


agccttgacc 


2520 


cagacttggg 


aggagtaggg 


agcttcacag 


gacaggtgac 


atcttagctg 


agacttgaag 


2580 


ggccgggagc 


agtgagtcag 


gccgagaagt agcaaagaag actgttctgc acagaggaac 


2640 


acaggtgtga 


aggctggagg 


acgggaggaa ggaggctgga 


ctgccctggg gtggggagcg 


2700 


ccatggctgg 


aggggcgggt 


ggctcagtgc 


ccgccgcccc 


caggattcaa ggctggctgg 


2760 


aggctacgag 


aatgtgccca 


ccgtggacat 


ccacatgaag caggtggggt 


acgaggacca 


2820 


gtggctgcag 


ctgctgcgga 


cgtatgtggg 


ccccatgacc 


gagagcctgt 


ttcccggtta 


2880 


ccacaccaag 


gcgcgggcgg 


tgatgaactt 


tgtggttcgc 


taccggccag 


acgagcagcc 


2940 


gtctctgcgg 


ccacaccacg 


actcatccac 


cttcaccctc 


aacgttgccc 


tcaaccacaa 


3000 


gggcctggac 


tatgagggag 


gtggctgccg 


cttcctgcgc 


tacgactgtg 


tgatctcctc 


3060 


cccgaggaag 


ggctgggcac 


tcctgcaccc 


cggccgcctc 


acccactacc 


acgaggggct 


3120 


gccaacgacc 


tggggcacac 


gctacatcat 


ggtgtccttt 


gtcgacccct 


gacactcaac 


3180 


cactccgccc 


acaggggttc 


gtgggcacag 


ggcttctggg 


gactccccgc 


gtgataaatt 


3240 


attaatgttc 


cgcagtctca 


ctctgaataa 


aggacagttt 


gtaagtcttg 


aaataataaa 


3300 


atagaatggc 


ctctgtgggc 


ctcacggagt 


ggtgcttatc 


tgggccgttc 


cttttattgt 


3360 


agggtaaatt 


aat 










3373 



<210> 32 

<211> 2655 

<212> DNA 

<213> Homo sapien 

<400> 32 

cagcccctgg agcttctccg ggcagaagag gaggtgggct ggactgtcaa agaaagtggc 60 

tggcttccgt gggcaatccg aggttcctag tggccctctg cagaaagctg aactcccagt 120 

tcccccaggc ctccgtcccg cggcggactc ctaggcttcc tccctccctg tcgccgccag 180 

cctctcgctc tccccgtctt ccgcccgcac tctctgggca gggctgcggc caggacccgc 240 

ccccgcgtcc cgcccgaccc tccgccaggg gtcacttccc ctgtccaggt ttcagcttcc 3 00 

acatgtgtca agcggctggc tcagcccaga gtccctgtct cccgcccgcc ggcccgagcc 360 
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gccgcccctc ccccgcctcc cgtgcgcccg ggacaatcct cgccttgtct gtggcgccgg 420 

catctggagc tttctgtagc ctccggatac gccttttttt cagggcgtag ccccagccaa 480 

gctgctcccc gcggcggccg cacagcagcc cgagcgcccc ctttccagag ctcccctccg 540 

gagctgggat ccaggcgcgt agcggagatc ccaggatcct gggtgctgtc tgggcccgct 600 

ccccaccatg acctcctcgg ggcctggacc ccggttcctg ctgctgctgc cgctgctgct 660 

gccccctgcg gcctcagcct ccgaccggcc ccggggccga gacccggtca acccagagaa 720 

gctgctggtg atcactgtgg ccacagctga aaccgagggg tacctgcgtt tcctgcgctc 780 

tgcggagttc ttcaactaca ctgtgcggac cctgggcctg ggagaggagt ggcgaggggg 840 

tgatgtggct cgaacagttg gtggaggaca gaaggtccgg tggttaaaga aggaaatgga 900 

gaaatacgct gaccgggagg atatgatcat catgtttgtg gatagctacg acgtgattct 960 

ggccggcagc cccacagagc tgctgaagaa gttcgtccag agtggcagcc gcctgctctt 1020 

ctctgcagag agcttctgct ggcccgagtg ggggctggcg gagcagtacc ctgaggtggg 1080 

cacggggaag cgcttcctca attctggtgg attcatcggt tttgccacca ccatccacca 1140 

aatcgtgcgc cagtggaagt acaaggatga tgacgacgac cagctgttct acacacggct 1200 

ctacctggac ccaggactga gggagaaact cagccttaat ctggatcata agtctcggat 1260 

ctttcagaac ctcaacgggg ctttagatga agtggtttta aagtttgatc ggaaccgtgt 1320 

gcgtatccgg aacgtggcct acgacacgct ccccattgtg gtccatggaa acggtcccac 13 80 

taagctgcag ctcaactacc tgggaaacta cgtccccaat ggctggactc ctgagggagg 1440 

ctgtggcttc tgcaaccagg accggaggac actcccgggg gggcagcctc ccccccgggt 1500 

gtttctggcc gtgtttgtgg aacagcctac tccgtttctg ccccgcttcc tgcagcggct 1560 

gctactcctg gactatcccc ccgacagggt cacccttttc ctgcacaaca acgaggtctt 1620 

ccatgaaccc cacatcgctg actcctggcc gcagctccag gaccacttct cagctgtgaa 1680 

gctcgtgggg ccggaggagg ctctgagccc aggcgaggcc agggacatgg ccatggacct 1740 

gtgtcggcag gaccccgagt gtgagttcta cttcagcctg gacgccgacg ctgtcctcac 1800 

caacctgcag accctgcgta tcctcattga ggagaacagg aaggtgatcg cccccatgct 1860 

gtcccgccac ggcaagctgt ggtccaactt ctggggcgcc ctgagccccg atgagtacta 1920 

cgcccgctcc gaggactacg tggagctggt gcagcggaag cgagtgggtg tgtggaatgt 1980 

accatacatc tcccaggcct atgtgatccg gggtgatacc ctgcggatgg agctgcccca 2040 

gagggatgtg ttctcgggca gtgacacaga cccggacatg gccttctgta agagctttcg 2100 

agacaagggc atcttcctcc atctgagcaa tcagcatgaa tttggccggc tcctggccac 2160 

ttccagatac gacacggagc acctgcaccc cgacctctgg cagatcttcg acaaccccgt 2220 
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cgactggaag 


gagcagtaca 


tccacgagaa 


ctacagccgg 


gccctggaag 


gggaaggaat 


2280 


c 9tggagcag 


ccatgcccgg 


acgtgtactg 


gttcccactg 


ctgtcagaac 


aaatgtgtga 


2340 


tgagctggtg 


gcagagatgg 


agcactacgg 


ccagtggtca 


ggcggccggc 


atgaggtaag 


2400 


ggcctgggcg 


aggagggcag 


aggccttccc 


agaggaggag 


gtgcaggatc 


tggtaaacaa 


2460 


acaatggaat 


caggaaaatg 


ggagcaaaga 


qqqqttccca 


qqqqqqaqqc 

333333^33 


aqaccctqqq 

-3 '-'333 


2520 


gtcacaactg 


cattcattca 


tgtattcaag 


aaatatttac 


tgctgggcac 


a 9tggctcat 


2580 


gcctgtaatc 


ctagcacttt 


gggaggctga 


ggcgggagga 


ttgctcgagc 


agggggagtt 


2640 


tgaggccagc 


ctggg 










2655 


<210> 33 

<211> 2570 

<212> DNA 

<213> Homo sapien 












<400> 33 
cagcccctgg 


agcttctccg 


ggcagaagag 


gaggtgggct 


ggactgtcaa 


agaaagtggc 


60 


tggcttccgt 


qqqcaatccq 


aggttcctag 


tggccctctg 


cagaaagctg 


aactcccagt 


120 


tcccccaggc 


ctccgtcccg 


cggcggactc 


ctaggcttcc 


tccctccctg 


tcgccgccag 


180 


cctctcgctc 


tccccgtctt 


ccgcccgcac 


tctctgggca 


gggctgcggc 


caggacccgc 


240 


ccccgcgtcc 


cgcccgaccc 


tccgccaggg 


gtcacttccc 


ctgtccaggt 


ttcagcttcc 


300 


acatgtgtca 


a 9cggctggc 


tcagcccaga 


gtccctgtct 


cccgcccgcc 


ggcccgagcc 


360 


gccgcccctc 


ccccgcctcc 


cgtgcgcccg 


ggacaatcct 


cgccttgtct 


gtggcgccgg 


420 


catctggagc 


tttctgtagc 


ctccggatac 


gccttttttt 


cagggcgtag 


ccccagccaa 


480 


gctgctcccc 


gcggcggccg 


cacagcagcc 


cgagcgcccc 


ctttccagag 


ctcccctccg 


540 


gagctgggat 


ccaggcgcgt 


agcggagatc 


ccaggatcct 


gggtgctgtc 


tgggcccgct 


600 


ccccaccatg 


acctcctcgg 


ggcctggacc 


ccggttcctg 


ctgctgctgc 


cgctgctgct 


660 


gccccctgcg 


gcctcagcct 


ccgaccggcc 


ccggggccga 


gacccggtca 


acccagagaa 


720 


gctgctggtg 


atcactgtgg 


ccacagctga 


aaccgagggg 


tacctgcgtt 


tcctgcgctc 


780 


tgcggagttc 


ttcaactaca 


ctgtgcggac 


cctgggcctg 


ggagaggagt 


ggcgaggggg 


840 


tgatgtggct 


cgaacagttg 


gtggaggaca 


gaaggtccgg 


tggttaaaga 


aggaaatgga 


900 


gaaatacgct 


gaccgggagg 


atatgatcat 


catgtttgtg 


gatagctacg 


acgtgattct 


960 


ggccggcagc 


cccacagagc 


tgctgaagaa 


gttcgtccag 


agtggcagcc 


gcctgctctt 


1020 


ctctgcagag 


agcttctgct 


ggcccgagtg 


ggggctggcg 


gagcagtacc 


ctgaggtggg 


1080 


cacggggaag 


cgcttcctca 


attctggtgg 


attcatcggt 


tttgccacca 


ccatccacca 


1140 
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aatcgtgcgc 
ctacctggac 
ctttcagaac 
gcgtatccgg 
taagctgcag 
ctgtggcttc 
gtttctggcc 
gctactcctg 
ccatgaaccc 
gctcgtgggg 
gtgtcggcag 
caacctgcag 
gtcccgccac 
cgcccgctcc 
accatacatc 

gagggatgtg 

agacaagggc 
ttccagatac 
cgactggaag 
cgtggagcag 
tgagctggtg 
aggtacraat 
ggccgacaac 
ccagcacagg 



cagtggaagt 
ccaggactga 
ctcaacgggg 
.aacgtggcct 
ctcaactacc 
tgcaaccagg 
gtgtttgtgg 
gactatcccc 
cacatcgctg 
ccggaggagg 
gaccccgagt 
accctgcgta 
ggcaagctgt 
gaggactacg 
tcccaggcct 
ttctcgggca 
atcttcctcc 
gacacggagc 
gagcagtaca 
ccatgcccgg 
gcagagatgg 
ggggccgggc 
ctggcggcca 
gcaaggggcc 



acaaggatga 
gggagaaact 
ctttagatga 
acgacacgct 
tgggaaacta 
accggaggac 
aacagcctac 
ccgacagggt 
actcctggcc 
ctctgagccc 
gtgagttcta 
tcctcattga 
ggtccaactt 
tggagctggt 
atgtgatccg 
gtgacacaga 
atctgagcaa 
acctgcaccc 
tccacgagaa 
acgtgtactg 
agcactacgg 
ctcacatgaa 
caggcggcgg 
agaccgggac 



tgacgacgac 
cagccttaat 
agtggtttta 
ccccattgtg 
cgtccccaat 
actcccgggg 
tccgtttctg 
cacccttttc 
gcagctccag 
aggcgaggcc 
cttcagcctg 
ggagaacagg 
ctggggcgcc 
gcagcggaag 
gggtgatacc 
cccggacatg 
tcagcatgaa 
cgacctctgg 
ctacagccgg 
gttcccactg 
ccagtggtca 

aggggggccc 

gaagggtccg 
gccaaaaggg 



cagctgttct 
ctggatcata 
aagtttgatc 
gtccatggaa 
ggctggactc 
gggcagcctc 
ccccgcttcc 
ctgcacaaca 
gaccacttct 
agggacatgg 
gacgccgacg 
aaggtgatcg 
ctgagccccg 
cgagtgggtg 
ctgcggatgg 
gccttctgta 
tttggccggc 
cagatcttcg 
gccctggaag 
ctgtcagaac 
ggcggccggc 
ggccaggtga 
gccaaaagtg 
gacacggggc 



acacacggct 
agtctcggat 
ggaaccgtgt 
acggtcccac 
ctgagggagg 
ccccccgggt 
tgcagcggct 
acgaggtctt 
cagctgtgaa 
ccatggacct 
ctgtcctcac 
cccccatgct 
atgagtacta 
tgtggaatgt 
agctgcccca 
agagctttcg 
tcctggccac 
acaaccccgt 
gggaaggaat 
aaatgtgtga 
atgagtggkr 
tcgggagttt 
gcccaaacca 



<210> 34 

<211> 1683 

<212> DMA 

<213> Homo sapien 

<400> 34 

gcattccagc ctgggtgaca gagcaagact ccgtctcaaa aaccaaaaac aagaaacaaa 
aaagaagtgc cacagcttgg ggtaggaact gac cgcattt tgggaggtgg ggtggttggg 
accagcagtg cttgacaagg gacatgcagg ggccttatgt tcagggactg ggataccctg 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2570 



60 
120 
180 
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agggaagaag 


gacttcccgg 


gatgagctgg 


gcagaatttg 


gggggctgtc 


ctcccagctg 


240 


accgcggtgt 


cggtgcctcc 


agggggggtg 


tgtggaatgt 


accatacatc 


tcccaggcct 


300 


atgtgatccg 


gggtgatacc 


ctgcggatgg 


agctgcccca 


gagggatgtg 


ttctcgggca 


360 


gtgacacaga 


cccggacatg 


gccttctgta 


agagctttcg 


agacaaggtg 


agcgcgggtg 


420 


cacggtctgg 


cctgggggca 


gtcccccaga 


ctccaggcat 


cgcctgcatc 


acggacaccc 


480 


ccaccccact 


acaggcatct 


tcctccatct 


gagcaatcag 


catgaatttg 


gccggctcct 


540 


ggccacttcc 


agatacgaca 


cggagcacct 


gcaccccgac 


ctctggcaga 


tcttcgacaa 


600 


ccccgtcgtg 


agtggggacc 


ccacgcccag 


agcacacagg 


ttccccgcct 


tgtgctaagg 


660 


aagacaccaa 


acgtgatggc 


tgctgccagc 


gcaggacact 


gagtgggagg 


gggcggtccc 


720 


ctgggggctg 


ggagcgaggg 


gggcacagat 


ctgatgtgcc 


ccccaccctc 


tcacaggact 


780 


ggaaggagca 


gtacatccac 


gagaactaca 


gccgggccct 


ggaaggggaa 


ggaatcgtgg 


840 


agcagccatg 


cccggacgtg 


tactggttcc 


cactgctgtc 


agaacaaatg 


tgtgatgagc 


900 


tggtggcaga 


gatggagcac 


tacggccagt 


ggtcaggcgg 


ccggcatgag 


gattcaaggc 


960 


tggctggagg 


ctacgagaat 


gtgcccaccg 


tggacatcca 


catgaagcag 


gtggggtacg 


1020 


aggaccagtg 


gctgcagctg 


ctgcggacgt 


atgtgggccc 


catgaccgag 


agcctgtttc 


1080 


ccggttacca 


caccaaggcg 


cgggcggtga 


tgaactttgt 


ggttcgctac 


cggccagacg 


1140 


agcagccgtc 


tctgcggcca 


caccacgact 


catccacctt 


caccctcaac 


gttgccctca 


1200 


accacaaggg 


cctggactat 


gagggaggtg 


gctgccgctt 


cctgcgctac 


gactgtgtga 


1260 


tctcctcccc 


gaggaagggc 


tgggcactcc 


tgcaccccgg 


ccgcctcacc 


cactaccacg 


1320 


aggggctgcc 


aacgacctgg 


ggcacacgct 


acatcatggt 


gtcctttgtc 


gacccctgac 


1380 


actcaaccac 


tctgccaaac 


ctgccctgcc 


attgtgcctt 


tttagggggc 


ctggcccccg 


1440 


tcctgggagt 


tgggggatgg 


gtctctctgt 


ctccccactt 


cctgagttca 


tgttccgcgt 


1500 


gcctgaactg 


aatatgtcac 


cttgctccca 


agacacggcc 


ctctcaggaa 


gctcccggag 


1560 


tccccgcctc 


tctcctccgc 


ccacaggggt 


tcgtgggcac 


agggcttctg 


gggactcccc 


1620 


gcgtgataaa 


ttattaatgt 


tccgcagtct 


cactctgaat 


aaaggacagt 


ttgtaagtct 


1680 


tga 












1683 



<210> 35 

<211> 2040 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc feature 



WO 2004/050858 



55/383 



PCT/US2003/038808 



<222> (2040) . . (2040) 
<223> n=a, c, g or t 



<400> 35 
gcggccgcag 


ctcagagccg 


tggcctctcc 


gaagtggcag 


atggggcggg 


cgcggccaga 


60 


gcaagtgcca 


ggcgggaaca 


gagggactgg 


gcgcgcctca 


caactcacca 


cctcgcccgc 


120 


tggtccttcc 


tggctcgcct 


ggctctgaag 


ctgcacctgg 


aggggaaacc 


tcagaacagt 


180 


aggcgggatt 


gcctagtaaa 


tatctcccat 


tcagggaggc 


ccaggtcgtg 


tgacgtcgac 


240 


agttgctggg 


tagatgaggc 


caacacaggt 


tgcaagaaga 


ggcggggttt 


agaggcgtga 


300 


aactccgcag 


tgctcagcca 


agcagggagc 


aacgctagga 


agggcgggca 


gaaagggcac 


360 


gctcttgtgg 


gtgactacag 


gttaggagac 


cgttgaacct 


ggaggggccc 


taggatggac 


420 


cccgtggaaa gattcagaga 


ctgcgccctc 


tccctggcgc 


cgccttcccc 


tacacgcggc 


480 


gggtatattc 


tgttgcagtt 


ggcccaggac 


ctgtttccaa 


gactctgccc 


cctcgcactt 


540 


ccgtccctcc 


tggttttgta 


aagtgatgct 


cataggaacc 


cccaccccgc gtgacactac 


600 


tcccagctcc 


tggctgactt 


ctagtcttct 


ggttgaagct 


gcgcctttag atgacacgac 


660 


cctacccacc 


cctgtttcca 


gcggatgccc 


gggcctggag 


tacctcttac 


tgtaacccat 


720 


cgccaagtgg 


gcttttgaag 


gcgcctgttc 


ctttctcgct 


ttcttcggaa 


gacccttgac 


780 


ccatcattcc 


cccgaccccc 


ataacgggag 


agcagagaag 


ccggtcccca gtgtgatggt 


840 


cctggtccag gcactaactg 


tccttttctc 


ggaaaaggca 


gggggatgtg gaaaagagtc 


900 


ttgttccctc 


cccttcgatc 


tgtggctttc 


gctttcactt 


cctcctccga 


gagcggacag 


960 


atctctgggt gctgggcggt 


catggcgcta 


ctagatgtat 


gcggagcccc 


cgagggcagc 


1020 


ggccggaatc ggctctcccg 


gttgcgggaa 


gcgggcgtcg 


ctcggaccca ggacactaca 


1080 


gtttctctat 


gcgatctcca 


gagctcgctt 


taccccgggg 


aatgcagccc 


acagaattct 


1140 


tccagtccct 


gggtggggac 


ggagaaagga 


acgttcagat 


tgagatggcc 


catggcacca 


1200 


ccacgctcgc 


cttcaagttc 


cagcatggag 


tgattgcagc 


agtggattct 


cgggcctcag 


1260 


ctgggtccta 


cattagtgcc 


ttacgggtga 


acaaggtgat 


tgagattaac 


ccttacctgc 


1320 


ttggcaccat 


gtctggctgt 


gcagcagact 


gtcagtactg 


ggagcgcctg 


ctggccaagg 


1380 


aatgcaggct 


gtactatctg 


cgaaatggag 


aacgtatttc 


agtgtcggca 


gcctccaagc 


1440 


tgctgtccaa 


catgatgtgc 


cagtaccggg 


gcatgggcct 


ctctatgggc 


agtatgatct 


1500 


gtggctggga 


taagaagggt 


cctggactct 


actacgtgga 


tgaacatggg actcggctct 


1560 


caggaaatat gttctccacg 


ggtagtggga 


acacttatgc 


ctacggggtc atggacagtg 


1620 


gcfcatcggcc 


taatcttagc 


cctgaagagg 


cctatgacct 


tggccgcagg gctattgctt 


1680 
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atgccactca 


cagagacagc 


tattctggag 


gcgttgtcaa 


tatgtccact 


gtcagcagct 


1740 


ggtcgcgggc 


cacttgaacg 


tcccatgtct 


tgccccgtgt 


gcccttccgg 


gagcgcctcc 


1800 


attgcggccg gcgccggggg 


gcgtcccggg 


aggacatgcg 


gccctccggg 


gcacacactt 


1860 


agtcctggga 


aaaaaggcgt 


cgtgtgccaa 


caaaaggtcg 


tctccacaag 


cgggaagcat 


1920 


atgggtcgtt 


ataaatcacg 


cgtgtgtgcc 


gccaaaatga 


taataatagg 


aggaggctcg 


1980 


acgccctatc 


catataacgg 


ctcacgcgtc 


catacttctt 


attactaatt 


tattcttcgn 


2040 



<210> 36 

<211> 1912 

<212> DNA 

<213> Homo sapien 

<400> 36 



gcggccgcag 


ctcagagccg 


tggcctctcc 


gaagtggcag 


atggggcggg 


cgcggccaga 


60 


gcaagtgcca 


ggcgggaaca 


gagggactgg 


gcgcgcctca 


caactcacca 


cctcgcccgc 


120 


tggtccttcc 


tggctcgcct 


ggctctgaag 


ctgcacctgg 


aggggaaacc 


tcagaacagt 


180 


aggcgggatt 


gcctagtaaa 


tatctcccat 


tcagggaggc 


ccaggtcgtg 


tgacgtcgac 


240 


agttgctggg 


tagatgaggc 


caacacaggt 


tgcaagaaga 


ggcggggttt 


agaggcgtga 


300 


aactccgcag 


tgctcagcca 


agcagggagc 


aacgcfcagga 


agggcgggca 


gaaagggcac 


360 


gctcttgtgg 


gtgactacag 


gttaggagac 


cgttgaacct 


ggaggggccc 


taggatggac 


420 


cccgtggaaa 


gattcagaga 


ctgcgccctc 


tccctggcgc 


cgccttcccc 


tacacgcggc 


480 


gggtatattc 


tgttgcagtt 


ggcccaggac 


ctgtttccaa 


gactctgccc 


cctcgcactt 


540 


ccgtccctcc 


tggttttgta 


aagtgatgct 


cataggaacc 


cccaccccgc 


gtgacactac 


600 


tcccagctcc 


tggctgactt 


ctagtcttct 


ggttgaagct 


gcgcctttag 


atgacacgac 


660 


cctacccacc 


cctgtfctcca 


gcggatgccc 


gggcctggag 


tacctcttac 


tgtaacccat 


720 


cgccaagtgg 


gcttttgaag 


gcgcctgttc 


ctttctcgct 


ttcttcggaa 


gacccttgac 


780 


ccatcattcc 


cccgaccccc 


ataacgggag 


agcagagaag 


ccggtcccca 


gtgtgatggt 


840 


cctggtccag 


gcactaactg 


tccttttctc 


ggaaaaggca 


gggggatgtg 


gaaaagagtc 


900 


ttgttccctc 


cccttcgatc 


tgtggctttc 


gctttcactt 


cctcctccga 


gagcggacag 


960 


atctctgggt 


gctgggcggt 


catggcgcta 


ctagatgtat 


gcggagcccc 


cgagggcagc 


1020 


ggccggaatc 


ggctctcccg 


gttgcgggaa 


gcgggcgtcg 


ctcggaccca 


ggacactaca 


1080 


gtttctctat 


gcgatctcca 


gagctcgctt 


taccccgggg 


aatgcagccc 


acagaattct 


1140 


tccagtccct 


gggtggggac 


ggagaaagga 


acgttcagat 


tgagatggcc 


catggcacca 


1200 


ccacgctcgc 


cttcaagttc 


cagcatggag 


tgattgcagc 


agtggattct 


cgggcctcag 


1260 
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ctgggtccta cattagtgcc ttacgggtga acaaggtgat tgagattaac ccttacctgc 1320 

ttggcaccat gtctggctgt gcagcagact gtcagtactg ggagcgcctg ctggccaagg 1380 

aatgcaggct gtactatctg cgaaatggag aacgtatttc agtgtcggca gcctccaagc 1440 

tgctgtccaa catgatgtgc cagtaccggg gcatgggcct ctctatgggc agtatgatct 1500 

gtggctggga taagaagggt cctggactct actacgtgga tgaacatggg actcggctct 1560 

caggaaatat gttctccacg ggtagtggga acacttatgc ctacggggtc atggacagtg 1620 

gctatcggcc taatcttagc cctgaagagg cctatgacct tggccgcagg gctattgctt 1680 

atgccactca cagagacagc tattctggag gcgttgtcaa tatccctgag gaatggtggt 174 0 

gacagtgaac ggcccttcgg ccgcgccagt cgggaccacc aagggggggg cgggcgtccc 1800 

cggggtggga gcggcacacc tttccgaaaa agtcgcaagg tgtccagagg gggttgttta 1860 
aaccagtggc cgaagaaagg gcccgctgcg tactggcgtt cttcaactta tt 



1912 



<210> 37 

<211> 1797 

<212> DNA 

<213> Homo sapien 

<400> 37 

gcggccgcag ctcagagccg tggcctctcc gaagtggcag atggggcggg cgcggccaga 60 

gcaagtgcca ggcgggaaca gagggactgg gcgcgcctca caactcacca cctcgcccgc 120 

tggtccttcc tggctcgcct ggctctgaag ctgcacctgg aggggaaacc tcagaacagt iso 

aggcgggatt gcctagtaaa tatctcccat tcagggaggc ccaggtcgtg tgacgtcgac 240 

agttgctggg tagatgaggc caacacaggt tgcaagaaga ggcggggttt agaggcgtga 300 

aactccgcag tgctcagcca agcagggagc aacgctagga agggcgggca gaaagggcac 360 

gctcttgtgg gtgactacag gttaggagac cgttgaacct ggaggggccc taggatggac 420 

cccgtggaaa gattcagaga ctgcgccctc tccctggcgc cgccttcccc tacacgcggc 480 

gggtatattc tgttgcagtt ggcccaggac ctgtttccaa gactctgccc cctcgcactt 540 

ccgtccctcc tggttttgta aagtgatgct cataggaacc cccaccccgc gtgacactac 600 

tcccagctcc tggctgactt ctagtcttct ggttgaagct gcgcctttag atgacacgac 660 

cctacccacc cctgtttcca gcggatgccc gggcctggag tacctcttac tgtaacccat 720 

cgccaagtgg gcttttgaag gcgcctgttc ctttctcgct ttcttcggaa gacccttgac 780 

ccatcattcc cccgaccccc ataacgggag agcagagaag ccggtcccca gtgtgatggt 840 

cctggtccag gcactaactg tccttttctc ggaaaaggca gggggatgtg gaaaagagtc 900 

ttgttccctc cccttcgatc tgtggctttc gctttcactt cctcctccga gagcggacag 960 
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atctctgggt gctgggcggt catggcgcta ctagatgtat gcggagcccc cgagggcagc 102 0 

ggccggaatc ggctctcccg gttgcgggaa gcgggcgtcg ctcggaccca ggacactaca 108 0 

gtttctctat gcgatctcca gagctcgctt taccccgggg aatgcagccc acagaattct 1140 

tccagtccct gggtggggac ggagaaagga acgttcagat tgagatggcc catggcacca 1200 

ccacgctcgc cttcaagttc cagcatggag tgattgcagc agtggattct cgggcctcag 1260 

ctgggtccta cattagtgcc ttacgggtga acaaggtgat tgagattaac ccttacctgc 1320 

ttggcaccat gtctggctgt gcagcagact gtcagtactg ggagcgcctg ctggccaagg 1380 

aatgcaggct gtactatctg cgaaatggag aacgtatttc agtgtcggca gcctccaagc 1440 

tgctgtccaa catgatgtgc cagtaccggg gcatgggcct ctctatgggc agtatgatct 1500 

gtggctggga taagaagggt cctggactct actacgtgga tgaacatggg actcggctct 1560 

caggaaatat gttctccacg ggtagtggga acacttatgc ctacggggtc atggacagtg 1620 

gctatcggcc taatcttagc cctgaagagg cctatgacct tggccgcagg gctattgctt 1680 

atgccactca cagagacagc tattctggag gcgttgtcaa tatgtaccac attgarcgag 174 0 

atggtkggtg aaagtaaagt cgatgtggac ctgtgcccgc cggaagccca aagtggg 1797 

<210> 38 

<211> 1713 

<212> DNA 

<213> Homo sapien 

<400> 38 

ttaactaggt cactgtagta cctctccagc aggcacctgg gcactgccct ccattgtccc 60 

cactcctgca gcacttgcca ggacccagtc cccagtcccc agtccccagg gcttggtggt 120 

acttgcagtg tcttgtgatg ctttgaagac accagctttg aagcaaaaag ttctaggtct 180 

gctttccact taagccccat cttaactgta tgaccttgga catgtcacta gacttgcttt 240 

agtgtctcca tctgtaaaat ggggataaag atggactctt ctctcccaaa tgcccagcat 300 

gatgtgcagg gcctgtcccc ccaggagaac ctgtctcacc atccttcccc acttcctccc 360 

acccccaggt gtgccagaaa tcaggcgaga tctccctgct gaagcagcag ctgaaagagt 420 

ctcaggcaga gctggtgcag aagggcagcg agctggtggc tctgcgggtg gcgctgcggg 480 

aggcccgtgc tacgctgcgg gtcagtgagg gccgtgcgcg gggtctacag gaggccgccc 54 0 

gagctcggga gctggagctg gaagcctgtt cccaggagct gcagcgacac cgccaggaag 600 

ctgagcagct gcgggagaaa gctgggcagt tggatgctga ggcggccgga ctccgggagc 660 

cccctgtgcc acctgccacc gctgacccat tcctcctggc agagagtgat gaggccaaag 720 

tgcagcgggc agcagccggg gttgggggca gcttgcgggc ccaggtggag cgat.tgcggg 780 
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tggagctgca 


gcgggagcgg 


cggcggggtg 


aggagcagcg ggacagcttt 


gagggggagc 


840 


ggctggcctg 


gcaggcagag 


aaggagcagg 


tgatccgcta 


ccagaagcag 


ctgcagcaca 


900 


actacatcca 


gatgtaccgg 


cgcaaccggc 


agctagagca 


ggagctgcag 


cagctcagcc 


960 


tggagctgga 


ggcccgggag 


ctcgctgacc 


tgggcctggc 


cgagcaggcc 


ccctgcatct 


1020 


gcctggagga 


gatcactgct 


actgagatct 


agggccctca 


gcaaccagct 


ctgtagggag 


1080 


ctctgccaga 


ggggcagcag 


ctgcagatcc 


acttaggccc 


cagggtccac 


ggatggcccc 


1140 


aaaggctgag 


ggccccaaag 


ccacttgtct 


cctaggatcc 


aggcctctgg gcttctgcca 


1200 


agaactcagg 


gtggccctat 


gacttggagg 


agcaagatca 


gaccgctcaa 


aggtccccgt 


1260 


gttcactgtt 


acccagaggc 


tcttgttact 


acccacttca 


ttccccaccg 


ctgccagtgc 


1320 


cactgccaac 


cctgttcaca 


ggcgcttcca 


gcccactcca 


gccaggggag 


cagggaagaa 


1380 


gaaggggctc 


cctcctcttc 


acattccccc 


cgaccccaaa 


gccagagaaa 


gccagatggc 


1440 


accagctgct 


ccggatgtgc 


ctgcccacat 


tgggggacag 


ggccgggcct 


gggctcggtt 


1500 


cccaggtttg 


agctctgcag 


cctctctcct 


ggagtgaggg ggctgaagtc agaccaaagg 


1560 


aagaactcag 


aaatgtcttg 


tttatttgtg 


tttgtgacca 


agcagcctct 


cccttcaccc 


1620 


aggtttatgg 


cctcgttttc 


acttgtatat 


ttttcacact 


gtaaatttct 


tgtacaaacc 


1680 


caaagaaaaa 


attaaaaaaa 


atttttttgt 


tta 






1713 


<210> 39 

<211> 2372 

<212> DNA 

<213> Homo sapien 












<400> 39 
gcggccgccg 


ccgcgccgtg 


gccctttgtg 


ctggaacaat 


gaccagctgc 


cgctggcccc 


60 


ggggccccgg 


ctggggttga 


gggtgaccac 


cgccgccgcc 


gcgtcgagcc gaaccgagcc 


120 


cccagacacg 


cagtcacagg 


gcgtcggagg 


gagggctccc 


gactgccgcc 


gagcagcccc 


180 


tacgtgcgcc 


ccggcggatt 


cgcgtgacgc 


gggtgcttgc 


gcgtgtccgc 


gccccggggc 


240 


ggagacgccc 


ctcggaccct 


cccctgcgcg 


gtgccgccca 


ctcgcaccgg gcgggggctg 


300 


gccagggccg 


ggggcgccgc 


ggaggcggag 


gcggggctgg 


gccgggcagc 


tccgagtggc 


360 


ggcgcgcggg 


ccagggccgg 


ggccggggcc 


ggggccggag 


ccggagccgg 


ggccgggcgg 


420 


cgcgcctgtg gagcgctggg 


ggcggcctgq 


ggctgagcat 


ggagcagcgc 


ggcggaggtg 


480 


aggggggccg ggtctgggag 


gcggcctggg 


gagcccccaa 


cttmtgcgcg 


gtggaatcca 


540 


gccgggggga 


cggcgttccc 


cagtcgcccc 


cactttctgg aaggggggct 


ggactctgga 


600 


gaatgggtca 


ggggctcctg 


gcttcccgcc 


gcagaatggg aactcgtgat 


cgcagacttg 


660 



i 
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accccgcttg actggtcctt ggagtttggg tgggagggag actttcccca actcttcaga 720 
tccgtctccg gccctgaacc agctccagcc ctctgctctc ctcagcccac taaaaggggg 780 

gtgctcctgg gccagagtga ttgctcaggg agtgctgggg agaatgaggg gttcccatct 840 

cccacagcca caaacgcatt tctcaaaatg gagcctccac ttgcccctcc ttctctttcg 900 

ggctccaaga gcgggagccg gcctggctgg gcctggcttt ccagactcct ggctctggct 960 

ccctggatgt gtggctctca gggcccggct ctgtggcact ttggacttat gggccctcgg 1020 

ctccgtgggg ttggggggcc ctggctcact gggtcccgct ggctgtcccc ttgtcggcct !080 

gaccctcggg cctctccgag tctttgtctc ttttgggtcc ctgtctttct ctgagcctgt 1140 

ctccttcctt caattttcac tttctccgtg ctcgctgaag tggggaaatg caacagaggt 1200 

gccagccccc ttggccccct aagaagtccc ttocccactg ttcactccac accctggagt 1260 

gggcccagag aggggagggg agggggttct ctatctgctc tgctgggggc tggggtgtgt 1320 

ggggtggtct gagccccagc cctacagagg ggcctgtggt cactccctgc tcctggccta 1380 

cccctccccc agctccccgc agctcctgcc ccagcgtgga cacgtggagc gattgcgggt 1440 

ggagctgcag cgggagcggc ggcggggtga ggagcagcgg gacagctttg agggggagcg 1500 

gctggcctgg caggcagaga aggagcaggt gatccgctac cagaagcagc tgcagcacaa 1560 
ctacatccag atgtaccggc gcaaccggca gctagagcag gagctgcagc agctcagcct 
ggagctggag gcccgggagc tcgctgacct gggcctggcc gagcaggccc cctgcatctg 
cctggaggag atcactgcta ctgagatcta gggccctcag caaccagctc tgtagggagc 
tctgccagag gggcagcagc tgcagatcca cttaggcccc agggtccacg gatggcccca 

aaggctgagg gccccaaagc cacttgtctc ctaggatcca ggoctctggg cttctgccaa 1860 

gaactcaggg tggccctatg acttggagga gcaagatcag accgctcaaa ggtccccgtg 1920 

ttcactgtta cccagaggct cttgttacta cccacttcat tccccaccgc tgccagtgcc 1980 

actgccaacc ctgttcacag gcgcttccag cccactccag ccaggggagc agggaagaag 2040 

aaggggctcc ctcctcttca cattcccccc gaccccaaag ccagagaaag ccagatggca . 2100 

ccagctgctc cggatgtgcc tgcccacatt gggggacagg gccgggcctg ggctcggttc 2160 

ccaggtttga gctctgcagc ctctctcctg gagtgagggg gctgaagtca gaccaaagga 2220 

agaactcaga aatgtcttgt ttatttgtgt ttgtgaccaa gcagcctctc ccttcaccca 2280 

ggtttatggc ctcgttttca cttgtatatt tttcacactg taaatttctt gtacaaaccc 2340 

aaagaaaaaa ttaaaaaaaa tttttttgtt ta 23?2 



1620 
1680 
1740 
1800 



<210> 40 
<211> 5075 
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<212> DNA 

<213> Homo sapien 

<400> 40 



gcggccgccg ccgcgccgtg gccctttgtg ctggaacaat 


gaccagctgc 


cgctggcccc 


60 


ggggccccgg 


ctggggttga gggt^gaccac cgccgccgcc 


gcgtcgagcc gaaccgagcc 


120 


cccagacacg cagtcacagg gcgtcggagg gagggctccc 


gactgccgcc gagcagcccc 


180 


tacgtgcgcc 


ccggcggatt cgcgtgacgc gggtgcttgc 


gcgtgtccgc 


gccccggggc 


240 


ggagacgccc 


ctcggaccct cccctgcgcg gtgccgccca 


ctcgcaccgg gcgggggctg 


300 


gccagggccg 


ggggcgccgc ggaggcggag gcggggctgg 


gccgggcagc 


tccgagtggc 


360 


ggcgcgcggg 


ccagggccgg ggccggggcc ggggccggag 


ccggagccgg 


ggccgggcgg 


420 


cgcgcctgtg gagcgctggg ggcggcctgg ggctgagcat 


ggagcagcgc 


ggcggaggtg 


480 


a ggggggccg ggtctgggag gcggcctggg gagcccccaa 


cttmtgcgcg gtggaatcca 


540 


gccgggggga 


cggcgttccc cagtcgcccc cactttctgg 


aaggggggct 


ggactctgga 


600 


gaatgggtca 


ggggctcctg gcttcccgcc gcagaatggg 


aactcgtgat 


cgcagacttg 


660 


accccgcttg actggtcctt ggagtttggg tgggagggag 


actttcccca 


actcttcaga 


720 


tccgtctccg gccctgaacc agctccagcc ctctgctctc 


ctcagcccac 


taaaaggggg 


780 


gtgctcctgg gccagagtga ttgctcaggg agtgctgggg 


agaatgaggg gttcccatct 


840 


cccacagcca 


caaacgcatt tctcaaaatg gagccfcccac 


ttgcccctcc 


ttctctttcg 


900 


ggctccaaga 


gcgggagccg gcctggctgg gcctggcttt 


ccagactcct 


ggctctggct 


960 


ccctggatgt 


gtggctctca gggcccggct ctgtggcact 


ttggacttat 


gggccctcgg 


1020 


ctccgtgggg 


ttggggggcc ctggctcact gggtcccgct 


ggctgtcccc 


ttgtcggcct 


1080 


gaccctcggg 


cctctccgag tctttgtctc ttttgggtcc 


ctgtctttct 


ctgagcctgt 


1140 


ctccttcctt 


caattttcac tttctccgtg ctcgctgaag 


tggggaaatg 


caacagaggt 


1200 


gccagccccc 


ttggccccct aagaagtccc ttccccactg 


ttcactccac 


accctggagt 


1260 


gggcccagag 


aggggagggg agggggttct ctatctgctc 


tgctgggggc 


tggggtgtgt 


1320 


ggggtggtct 


gagccccagc cctacagagg ggcctgtggt 


cactccctgc 


tcctggccta 


1380 


cccctccccc 


agctccccgc agctcctgcc ccagcgtgga 


cacctcttcc 


tggttcgcag 


1440 


tgtcttttga cttgggtggg aaggagggcg gtggatttaa 


tggagggcat 


gcgagaaaga 


1500 


aggggaacag atcaggcaaa ggagttagat aatgtcccac 


ctctccccac 


acgccgaaga 


1560 


gagaaggaaa 


caggctcccc ccaccaatgg gccctcaagc 


cacccaggct 


cccaccactc 


1620 


tcatacctgc 


gtgcccacgt tcacaggctc cactgcacca 


ctcaccttcc 


tagtcacaca 


1680 


caaacacagg 


agctcacaca taaactcatg aacttactga 


caagtctgtc 


caggcaccgt 


1740 
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ctctcccaca caccacaaat tcacttgtta gatccacatt tataactttc acaggctttc 1800 

taatgacccc tgatccacat actcttctaa gatatacaca tgtgaattca cacatgaact 1860 

catatgtatg tgaatgaact gatatatcaa acatgcatag actcacatga tgttttcaca 1920 

cacattttca tacacacgtg ttcattttca tgctcctgtg catgtagtca cacaacaaat 1980 

aactaaatgc acacactcct actcccagtg caattctctg tccacttctt ggccccgctc 2040 

tgcatgatga cttcagcaga tgcctaagtc ctggtcatca gcggcctcgg gactgggact 2100 

agggaggggg ttccgggtgg aggcttgcag ctgcttccag gaaggagcag agcctgggga 2160 

ggtggagccc tcttgcctgg gctctgtgga agtgaggttt tggagcagcc tccagaggtg 222 0 

aaggcagggc atctaccttc ctcccctcag ggttagtttg gggaccttct gaagctgtag 2280 

ggtttcagag aggttggcag tggggcagag cctgtggagg ctccttcagg ccttcaggcc 2340 

tcaccgattg ggtgtgcaga ggctaggagt tgtcatggaa actggtatga gcctgcatcc 2400 

acctctggga ggcaggagac cccaaaggac acctgtggac tccctaagag ccgcgggttt 2460 

cttatttagc cagatcgaca gagaggcagt ttagccgcct ttctgtctga gagaaactga 2520 

ggcccagcag agggaggcag ggcactgcct ggccttggct ctttgttccc taatfcaccat 2580 

ctgaggtgct gattgcaccc ctgaggtacc tctgcccaaa ctgccctgta atgaggccac 2640 

ctggctgaga ggaccaggga cctgtgcttc aagctggtag agaaggcccc tccctcacct 2700 

gagtcctggc ctctgtggag gatgcaggat tgggcttggg cctgggtctt tgctgccagg 2760 

tggccctgcg gtaatcatcc aggcctccca tgcctctcag catctctgct gagtgaattc 2820 

actcctcagc cctgttcaga ccagcattgg gcagtgtggt ttggggccca gggggaattg 2880 

agattcattc cagcacttga agggggagtc ttggtgggga tcagggcttg ttactgatgt 2940 

caaggaggtg tttggtcagg atcagtgggt caggaggggt ctcaaggtgg agggactctg 3000 

acaccactgc cctgcttaca ggttgcctgc gaggccgctg gccaggcctg agcctctgcc 3060 

accatggcca ttgtgcagac tctgccagtg ccactggagc ctgctcctga agctgccact 3120 

gccccacaag ctccagtcat gggtagtgtg agcagcctta tctcaggccg gccctgtccc 3180 

9gggggccag ctcctccccg ccaccacggc cctcctgggc ccaccttctt ccgccagcag 324 0 

gatggcctgc tacggggtgg ctatgaggca caggagccgc tgtgcccagc tgtgccccct 3300 

aggaaggctg tccctgtcac cagcttcacc tacatcaatg aggacttccg gacagagtca 3360 

ccccccagcc caagcagtga tgttgaggat gcccgagagc agcgggcaca caatgcccac 3420 

ctccgcggcc caccaccaaa gctcatccct gtctctggaa agctggagaa gaacatggag 3480 

aagatcctga tccgcccaac agccttcaag ccagtgctgc ccaaacctcg aggggctccg 3540 
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tccctgccta gcttcatggg tcctcgggcc accgggctgt ctgggagcca gggcagcctg 3600 

acgcagctgt ttgggggccc tgcctcctcc tcctcctctt cctcctcctc ttcagctgct 3660 

gacaaacccc tggcatttag tggctgggcc agtggctgcc catcagggac gctatccgac 372 0 

tctggccgaa actcactgtc cagcctgccc acctacagca ccggaggtgc cgagccaacc 3780 

accagctccc caggcgggca cctgccttcc catggctctg ggcgaggggc actgcctggg 3840 

ccagcccgag gggtccctac tgggccctcc cactcagaca gtggccggtc ctcctccagc 3 900 

aagagcacag gctccctagg gggccgtgtg gctggggggc ttttgggcag tggtactcgg 3960 

gcctcccctg acagcagctc ctgtggggag cgctcaccac cacccccgcc tccacctcct 4020 

tcggatgagg ccctgctgca ctgtgtcctg gaaggaaagc tccgagaccg ggaggcagag 4080 

cttcagcagc tgcgggacag tctggacgag aatgaggcta ccatgtgcca ggcatacgag 4140 

gagcggcagc ggcactggca gcgagagcgt gaggccctgc gagaggactg tgcggcccag 4200 

gcacagcggg cacagcgggc ccaacagctg ctgcagctgc aggtgttcca gctgcagcag 4260 

gagaagcggc aattgcagga cgacttcgca cagctgctgc aggagcgcga acagctggag 4320 

cggrrgctgcg ccaccttgga gcgggagcgg cggcggggtg aggagcagcg ggacagcttt 4380 

gagggggagc gggatcactg ctactgagat ctagggccct cagcaaccag ctctgtaggg 444 0 

agctctgcca gaggggcagc agctgcagat ccacttaggc cccagggtcc acggatggcc 4500 

ccaaaggctg agggccccaa agccacttgt ctcctaggat ccaggcctct gggcttctgc 4560 

caagaactca gggtggccct atgacttgga ggagcaagat cagaccgctc aaaggtcccc 4620 

gtgttcactg ttacccagag gctcttgtta ctacccactt cattccccac cgctgccagt 4680 

gccactgcca accctgttca caggcgcttc cagcccactc cagccagggg agcagggaag 4740 

aagaaggggc tccctcctct tcacattccc cccgacccca aagccagaga aagccagatg 4800 

gcaccagctg ctccggatgt gcctgcccac attgggggac agggccgggc ctgggctcgg 4860 

ttcccaggtt tgagctctgc agcctctctc ctggagtgag ggggctgaag tcagaccaaa 4920 

ggaagaactc agaaatgtct tgtttatttg tgtttgtgac caagcagcct ctcccttcac 4980 

ccaggtttat ggcctcgttt tcacttgtat atttttcaca ctgtaaattt cttgtacaaa 5040 

cccaaagaaa aaattaaaaa aaattttttt gttta 5075 

<210> 41 

<211> 5062 

<212> DMA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (5062) . . (5062) 
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<223> n=a, c, g or t 



<400> 41 

agcggattga tccggcggcg cgggattggc cagtcccggg ccaacacggg tggcgggagt 60 

tttcagatgt gttggctgcg gccccgactt cagcggttgg gctaggctgt gctgactgtt 120 

tgggtggcgg accccgggag taaaacctgt tgtcgatccc tcagcttcca gcttgcggct 180 

tgctgagtgg ccaccttctt tccggtcccc ggagctgcgg ggaaagatgg aacggcacca 240 

gccgcggctg caccacccgg cccaaggctc agccgctggg actccctacc cttcctcagc 3 00 

ctctctccgc ggctgccggg aaagcaagat gccgcgcagg aagggccccc aacaccctcc 360 

gccgcccagt ggccccgagg agcctgggga gaagcgcccc aagtttcatt taaatattag 420 

gacactgacg gatgatatgc tggacaaatt tgccagcata agaattccag ggagcaagaa 4 80 

agagagacct ccacttccca acctgaagac tgcatttgca agcagtgatt gctcagcagc 540 

acctttagag atgatggaga actttccaaa gccactgtca gagaatgaac tcttagaact 600 

ttttgaaaaa atgatggaag atatgaattt aaatgaagat aaaaaggcac cattgcggga 660 

aaaggacttc agtatcaaaa aagaaatggt gatgcagtac attaatactg cttctaagac 720 

aggaagtctt aagagaagcc gacagatctc acctcaggaa ttcattcatg agctgaaaat 780 

ggggtctgca gatgagagac ttgtcacatg cctggagtct ctccgagtgt ctttgaccag 840 

caatcctgtg agttgggtgg aaagctttgg acatgaaggg cttggattat tattagacat 900 

tttggaaaaa ctgattagtg gaaaaatcca agaaaaagtt gtaaagaaaa atcaacataa 960 

agtcatacag tgtctaaaag ccctgatgaa tacgcagtat ggcttggaaa gaattatgag 1020 

tgaggagagg agcctttcct tattggccaa agccgtggat cccagacacc ccaatatgat 1080 

gacagatgtg gttaaacttc tctctgcggt atgcattgta ggggaagaaa gcatccttga 1140 

agaagtttta gaagctttaa cttcagctgg tgaagaaaaa aaaattgaca gatttttttg 1200 

tattgtggaa ggcctccggc acaattcagt tcaactgcaa gtagcttgta tgcagctcat 1260 

caatgccctg gttacatctc ctgatgattt ggatttcagg cttcacatca gaaatgaatt 132 0 

tatgcgttgt ggattgaaag agatattgcc aaatttaaaa tgcattaaga atgatggcct 1380 

ggatatccaa cttaaagtct ttgatgagca taaagaagaa gatttgtttg agttatccca 1440 

tcgccttgaa gatattagag ctgaacttga tgaagcatat gatgtttaca acatggtgtg 1500 

gagcacagtt aaagaaacta gagcagaggg atattttatt tctattcttc agcatctttt 1560 

gctgattcga aatgattatt ttataaggca acaatacttc aaattaattg atgagtgtgt 1620 

atcccagatt gtattgcata gagatggaat ggatccagac ttcacatatc gaaaaagact 1680 



WO 2004/050858 



65/383 



PCT/US2003/038808 



agatttagat ttaacccagt ttgtagacat ttgcatagat caagcaaaac tagaagagtt 1740 

tgaagagaaa gcatcagaac tttacaagaa atttgaaaaa gagtttaccg accaccaaga 1800 

aactcaggct gaattgcaga aaaaagaggc aaagattaat gagcttcaag cagagctaca 1860 

agcttttaag tctcagtttg gtgccttgcc agctgattgt aatattcctt tgcctccctc 1920 

taaagaaggt ggaactggcc actcagcact tcctcctccg cctccactgc cttctggtgg 1980 

aggggtgccg cctccacctc ctcccccacc acctcctcca cttccaggaa tgcggatgcc 204 0 

attcagtggt cctgtgcctc caccacctcc cctgggattc cttggaggac aaaattctcc 2100 

tcctctacca atcctgccat ttgggttgaa accaaagaaa gaatttaaac ctgaaatcag 2160 

catgagaaga ttgaattggt taaagatcag acctcatgaa atgactgaaa actgtttctg 2220 

gataaaagta aatgaaaata agtatgaaaa cgtggatttg ctttgtaaac ttgagaatac 22 80 

attttgttgc caacaaaaag agagaagaga agaggaagat attgaagaga agaaatcgat 2340 

taagaaaaaa attaaagaac ttaagttttt agattctaaa attgcccaga acctttcaat 2400 

cttcctgagc tcttttcggg tgccatatga ggaaatcaga atgatgatat tggaagtaga 2460 

tgaaacacgg ttggcagagt ctatgattca gaacttaata aagcatcttc ctgatcaaga 2520 

gcaattaaat tcattgtctc agttcaagag tgaatatagc aacttatgtg aacctgagca 2580 

gtttgtggtt gtgatgagca atgtgaagag actacggcca cggctcagtg ctattctctt 264 0 

taagcttcag tttgaagagc aggtgaacaa catcaaacct gacatcatgg ctgtcagtac 2700 

tgcctgcgaa gagataaaga agagcaaaag ctttagcaag ttgctggaac ttgtattgct 2760 

aatgggaaac tacatgaatg ctggctcccg gaatgctcaa accttcggat ttaaccttag 2820 

ctctctctgt aaactaaagg acacaaaatc agcagatcag aaaacaacgc tacttcattt 2880 

cctggtagaa atatgtgaag agaagtaccc tgatatactg aattttgtgg atgatttgga 2940 

acctttagac aaagctagta aagtctctgt agaaacgctg gaaaagaatt tgaggcagat 3000 

gggaaggcag cttcaacagc ttgagaagga attggaaacc tttccccctc ctgaggactt 3060 

gcatgacaag tttgtgacaa agatgtccag atttgttatc agtgcaaaag aacaatatga 3120 

gacactttcg aagttacacg aaaacatgga aaagttatac cagagtataa taggatacta 3180 

tgccattgat gtgaagaagg tgtctgtgga agactttctt actgacctga ataacttcag 3240 

aaccacattc atgcaagcaa taaaggagaa tatcaaaaaa agagaagcag aggaaaaaga 33 00 

aaaacgtgtc agaatagcta aagaattagc agagcgagaa agactcgaac gccaacaaaa 3360 

gaaaaagcgt ttattagaaa tgaagactga gggtgatgag acaggagtga tggataatct 3420 

gctggaggcc ttgcagtccg gggctgcctt ccgcgacaga agaaaaagga caccgatgcc 3480 

aaaagatgtt cggcagagtc tcagtccaat gtctcagagg cctgttctga aagtttgtaa 3540 
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ccatgaaaat 


cagaaagtgc 


agttgacaga 


agggtcacgt 


tcacactaca 


atatcaattg 


3600 


caactcaaca 


aggactccag 


tcgccaagga 


gcttaattat 


aatctagaca 


ctcatacgtc 


3660 


tactgggagg 


atcaaggcag 


ctgagaagaa 


ggaagcgtgt 


aatgtagaaa 


gcaacagaaa 


3720 


aaaggaaacg 


gaacttcttg 


gctctttttc 


taaaaatgaa 


tcagttcccg 


aagttgaagc 


3780 


cctgctggca 


agatfcacgag 


ctttataagt 


taaactggtt 


tttaaaaaaa 


atgattaagc 


3840 


caaatataaa 


gccatgctct 


aaactataac 


acttgaaaaa 


attgctttta 


tgtaagtgac 


3900 


tttatatagt 


tttaaattat 


gatatatatt 


agaaaaataa 


agctaaatat 


atgaatgcaa 


3960 


tgctttttct 


atgtactgga 


ggttttggtt 


tattcaagat 


tgtaccgggc 


tttaatcctt 


4020 


tttttttttt 


ttttgtctcc 


tggtattcct 


cagttcttta 


tttggtggtt 


aaattataca 


4080 


tattgcttta 


gagagcaggt 


aggtggccat 


gtgttcagca 


gtgtgtcctt 


aagaaaatac 


4140 


catctttcta 


agccactgga 


atttttactt 


tactattttt 


aacattaatg 


gatgtcaggt 


4200 


catcaacctc 


aagtctttac 


atatccatgg 


tatattccat 


atatattgtt 


tatataggcc 


4260 


caagtttctc 


cttaattggg 


atctatatac 


taccagcaca 


acatcaaaaa 


catgtaattg 


4320 


aatacatcag 


agctatatat 


gtaaggaaat 


gactggtgac 


cccattatca 


tcattgttga 


4380 


attcatgtta 


agtagaccct 


ctaggggacc 


ataaggcaat 


tgagcacata 


acgaaaaatg 


4440 


atgcaataag 


aatgtatgca 


ctctctttgc 


caaatgcatg 


tgcttttgtg 


taacgtggat 


4500 


gtaaacagaa 


ttgcagtgct 


gccgaaattc 


ttgatcttgg 


ctaagagagt 


atttttcccc 


4560 


ttgtaattat 


gactctgaga 


taaaattgcc 


attttgaaat 


ttccaaagta 


acaacttttt 


4620 


ttattttatg 


aataaacttg 


ggattgcaat 


ttctctgatc 


tgacaatcaa 


taactttaac 


4680 


aaagatctaa 


ataagtgttt 


caaggaaagt 


tttcctaagc 


aaatgtaata 


ttacctcatt 


4740 


tgggcatcat 


tactctgtta 


attctatatc 


aaaggaaata 


aacttgctac 


ttgcactaaa 


4800 


tgaaaaaaca 


cgaacaaaag 


gaagaaccag 


aaaaaaacag 


aaaaaacaag 


aaaatacgaa 


4860 


aaaaaaaaar 


« y ci cl cl y ci cl \* 


CtCtCiyClClCL^jClCl 




ora a era. acraa a 


aacaaaaaaa 


4920 


aaagaggccc 


agatcataac 


ccgagagaaa 


acgagactcg 


agattagaga 


aagaagtaaa 


4980 


taaaccgcgg 


ggtaatgaga 


gaaagaagca 


gaagcacgag 


gagagaaaca 


ccaagagaga 


5040 


atagaaaaca 


tcaacaataa 


an 








5062 


<210> 42 

<211> 2672 

<212> DNA 

<213> Homo sapien 












<400> 42 
acgtccaccc 


cgaatccctg 


cttaaaggcc 


ttgctttctt 


gtctaacgcc 


gcaaccagtc 


60 
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ctctgagttg ccaacgtctfc tcttcttgtc tcgacgcccc gtcgtccggc cacagcgatt 120 

ctctgcttag caggatcggt ccacagcggg acgtgagtcc ctttcctcct cgcggcttac 180 

cgcctctctc cgcctagtgc caggtgctaa taaagttgtt gtttcaaatg cggccaggaa 240 

catcgcgagc ggggaccaat cagagagtag ctttgcctct afcaacggcgc gagagtgaga 300 

cgtcatcggt gagcgactaa cgctagaaac agtggtgcgc ggagaggaga ggcctcggga 360 

tgtctctggc agatgagctc ttagctgatc tcgaagaggc agcagaagag gaggaaggag 420 

gaagctatgg ggaggaagaa gaggagccag cgatcgagga tgtgcaggag gagacacagc 480 

tggatctttc cggggattca gtcaagacca tcgccaagct atgggatagt aagatgtttg 540 

ctgagattat gatgaagatt gaggagtata tcagcaagca agccaaagct tcagaagtga 600 

tgggaccagt ggaggccgcg cctgaatacc gcgtcatcgt ggatgccaac aacctgaccg 660 

tggagatcga aaacgagctg aacatcatcc ataagttcat ccgggataag tactcaaaga 720 

gattccctga actggagtcc ttggtcccca atgcactgga ttacatccgc acggtcaagg 780 

agctgggcaa cagcctggac aagtgcaaga acaatgagaa cctgcagcag atcctcacca 840 

atgccaccat catggtcgtc agcgtcaccg cctccaccac ccaggggcag cagctgtcgg 900 

aggaggagct ggagcggctg gaggaggcct gcgacatggc gctggagctg aacgcctcca 960 

agcaccgcat ctacgagtat gtggagtccc ggatgtcctt catcgcaccc aacctgtcca 1020 

tcattatcgg ggcatccacg gccgccaaga tcatgggtgt ggccggcggc ctgaccaacc 108 0 

tctccaagat gcccgcctgc aacatcatgc tgctcggggc ccagcgcaag acgctgtcgg 1140 

gcttctcgtc tacctcagtg ctgccccaca ccggctacat ctaccacagt gacatcgtgc 12 00 

agtccctgcc accggatctg cggcggaaag cggcccggct ggtggccgcc aagtgcacac 1260 

tggcagcccg tgtggacagt ttccacgaga gcacagaagg gaaggtgggc tacgaactga 1320 

aggatgagat cgagcgcaaa ttcgacaagt ggcaggagcc gccgcctgtg aagcaggtga 1380 

agccgctgcc tgcgcccctg gatggacagc ggaagaagcg aggcggccgc aggtgagggg 1440 

ccctgggggt ccggtaggca tgggggtcat ggaggggaga agccggcgtc ctcctcccag 15 00 

ccgactccct ggcgccgccc acccacccgt ccccgtaccg caagatgaag gagcggctgg 1560 

ggctgacgga gatccggaag caggccaacc gtatgagctt cggagaggtc agactcccag 1620 

agcgccctcc tcaaccccac agccagccag ccgccaccgc cctctgcctc ctgccaccgc 1680 

ccctcctctc gtcctgtggc cctggctcat gtctagggcg ctgccccagc ctcccccccc 1740 

gccggcctct attctcgttt ccatccattc agccccaaag cgaccctcgc ggcccttgga 1800 

gcctgtgtct ccgctgctta gagcccccgc ggcttcccat cgccccgggc tccttggccg 1860 
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gttcctccct gcccagaggc tccttagtgc cctgctgcac ggccgccccg tccctgggcc 1920 

cggccagtct cctctgttat cccagcgtca tccccttggt cctgcaggac cgaaactcag 1980 

aggccacctc atcctattaa acctgttctg gttcctgaca tcccccgacc cacacgaatc 2040 

gaggaggacg cctaccagga ggacctggga ttcagcctgg gccacctggg caagtcgggc 2100 

agtgggcgtg tgcggcagac acaggtaaac gaggccacca aggccaggat ctccaagacg 2160 

ctgcagcgga ccctgcagaa gcagagcgtc gtatatggcg ggaagtccac catccgcgac 2220 

cgctcctcgg gcacggcctc cagcgtggcc ttcaccccac tccagggcct ggagattgtg 2280 

aacccacagg cggcagagaa gaaggtggct gaggccaacc agaagtattt ctccagcatg 2340 

gctgagttcc tcaaggtcaa gggcgagaag agtggcctta tgtccacctg aatgactgcg 2400 

tgtgtccaag gtggcttccc actgaaggga cacagaggtc cagtccttct gaagggctag 2460 

gatcgggttc tggcagggag aacctgccct gccactggcc ccattgctgg gactgcccag 2520 

ggaggaggcc ttggaagagt ccggcctggc ctcccccagg accgagatca ccgcccagta 2580 

tgggctagag caggtcttca tcatgccttg tcttttttaa ctgagaaagg agattttttg 2640 

aaaagagtac aattaaaagg acattgtcaa ga 2672 

<210> 43 

<211> 2436 

<212> DNA 

<213> Homo sapien 

<400> 43 

cgccgcccct atgctgggac caggctccca gtgggcagcc ctggsctctt ccttaacgct 60 

gcagctgcga ctccacgtac tccctcagtg actggagcga cggagaccgt cacgccctca 120 

gaggccccgg tgctggcagc tgagcccgag gctgacaagg gcactgtgtt ggcactcact 180 

gaaaataact tcgatgacac cattgcagaa ggaataacct tcatcaagtt ttatgctcca 240 

tggtgtggtc attgtaagac tctggctcct acttgggagg aactctctaa aaaggaattc 300 

cctggtctgg cgggggtcaa gatcgccgaa gtagactgca ctgctgaacg gaatatctgc 360 

agcaagtatt cggtacgagg ctaccccacg ttattgcttt tccgaggagg gaagaaagtc 420 

agtgagcaca gtggaggcag agaccttgac tcgttacacc gctttgtcct gagccaagcg 480 

aaagacgaac tttaggaaca cagttggagg tcacctctcc tgcccagctc ccgcaccctg 540 

cgtttaggag ttcagtccca cagaggccac tgggttccca gtggtggctg ttcagaaagc 600 

agaacatact aagcgtgagg tatcttcttfc gtgtgtgtgt tttccaagcc aacacactct 660 

acagattctt tattaagtta agtttctcta agtaaatgtg taactcatgg tcactgtgta 720 

aacattttca gtggcgatat atcccctttg accttctctt gatgaaattt acatggtttc 780 
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ctttgagact 


aaaatagcgt 


tgagggaaat 


gaaattgctg gactatttgt ggctcctgag 


840 


ttgagtgatt 


ttggtgaaag 


aaagcacatc 


caaagcatag tttacctgcc cacgagttct 


900 


ggaaaggtgg ccttgtggca 


gtattgacgt 


tccfcctgatc 


ttaaggtcac 


agttgactca 


960 


atactgtgtt 


ggtccgtagc 


atggagcaga 


ttgaaatgca 


aaaacccaca 


cctctggaag 


1020 


ataccttcac 


ggccgctgct 


ggagcttctg 


ttgctgtgaa 


tacttctctc agtgtgagag 


1080 


gttagccgtg 


atgaaagcag 


cgttacttct 


gaccgtgcct 


gagtaagaga 


atgctgatgc 


1140 


cataacttta 


tgtgtcgata 


cttgtcaaat 


cagttactgt 


tcaggggatc 


cttctgtttc 


1200 


tcacggggtg aaacatgtct 


ttagttcctc 


atgttaacac gaagccagag cccacatgaa 


1260 


ctgttggatg 


tcttccttag 


aaagggtagg 


catggaaaat 


tccacgaggc 


tcattctcag 


1320 


tatctcatta 


actcattgaa 


agattccagt 


tgtatttgtc 


acctggggtg acaagaccag 


1380 


acaggctttc 


ccaggcctgg 


gtatccaggg 


aggctctgca gccctgctga agggccctaa 


1440 


ctagagttct 


agagtttctg 


attctgtttc 


tcagtagtcc 


ttttagaggc 


ttgctatact 


1500 


tggtctgctt 


caaggaggtc 


gaccttctaa 


tgtatgaaga 


atgggatgca 


tttgatctca 


1560 


agaccaaaga 


cagatgtcag 


tgggctgctc 


tggccctggt 


gtgcacggct 


gtggcagctg 


1620 


ttgatgccag 


tgtcctctaa 


ctcatgctgt 


ccttgtgatt 


aaacacctct 


atctcccttg 


1680 


ggaataagca 


catacaggct 


taagctctaa 


gatagatagg 


tgtttgtcct 


tttaccatcg 


1740 


agctacttcc 


cataataacc 


actttgcatc 


caacactctt 


cacccacctc 


ccatacgcaa 


1800 


ggggatgtgg 


atacttggcc 


caaagtaact 




atcttagaaa 


caagaccact 


1860 


tatactgtct 


gtctgaggca 


gaagataaca 


gcagcatctc 


gaccagcctc 


tgccttaaag 


1920 


gaaatcttta 


ttaatcacgt 


atggttcaca 


gataattctt 


tttttaaaaa 


aacccaacct 


1980 


cctagagaag 


cacaactgtc 


aagagtcttg 


tacacacaac 


ttcagctttg catcacgagt 


2040 


cttgtattcc 


aagaaaatca 


aagtggtaca 


atttgtttgt 


ttacactatg atactttcta 


2100 


aataaactct 


ttttttttaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaacgacga 


aaaaagaaaa 


2160 


aagggcggtc 


cgaatccccg 


ggccagtacg 


tcccgtgtgt 


aaaggggcca 


aaaggacgaa 


2220 


ttatagacag 


gcaggggcgc 


gttaacacgg 


ttgtgcggca aaacgttatg ggaatttgga 


2280 


agggaccctc 


ttggtgggtg 


acatattgac 


acatacctag gaattagctt aggataataa 


2340 


aatttagggt 


ttaatggtta 


acaagggatt 


ttctgggtta aaatttgctg tcgaaatttt 


2400 


tttgaaaatt 


tttttctgga 


cttttaaata 


ggggtt 






2436 



<210> 44 

<211> 1560 

<212> DNA 

<213> Homo sapien 
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<400> 44 

acggaccaac acagtattga gtcaactgtg accttaagat cagrggaacg tcaatactgc 60 

cacaaggcca cctttccaga actcgtgggc aggtawacta tgctttggag cagattgaaa 120 

tgcaaaaacc cacacctctg gaagatacct tcacggccgc tgctggagct tctgttgctg 180 

tgaatacttc tctcagtgtg agaggttagc cgtgatgaaa gcagcgttac ttctgaccgt 240 

gcctgagtaa gagaatgctg atgccataac tttatgtgtc gatacttgtc aaatcagtta 300 

ctgttcaggg gatccttctg tttctcacgg ggtgaaacat gtctttagtt cctcatgtta 360 

acacgaagcc agagcccaca tgaactgttg gatgtcttcc ttagaaaggg taggcatgga 420 

aaattccacg aggctcattc tcagtatctc attaactcat tgaaagattc cagttgtatt 480 

tgtcacctgg ggtgacaaga ccagacaggc tttcccaggc ctgggtatcc agggaggctc 540 

tgcagccctg ctgaagggcc ctaactagag ttctagagtt tctgattctg tttctcagta 600 

gtccttttag aggcttgcta tacttggtct gcttcaagga ggtcgacctt ctaatgtatg 660 

aagaatggga tgcatttgat ctcaagacca aagacagatg tcagtgggct gctctggccc 72 0 

tggtgtgcac ggctgtggca gctgttgatg ccagtgtcct ctaactcatg ctgtccttgt 780 

gattaaacac ctctatctcc cttgggaata agcacataca ggcttaagct ctaagataga 84 0 

taggtgtttg tccttttacc atcgagctac ttcccataat aaccactttg catccaacac 900 

tcttcaccca cctcccatac gcaaggggat gtggatactt ggcccaaagt aactggtggt 960 

aggaatctta gaaacaagac cacttatact gtctgtctga ggcagaagat aacagcagca 1020 

tctcgaccag cctctgcctt aaaggaaatc tttattaatc acgtatggtt cacagataat 1080 

tcttttttta aaaaaaccca acctcctaga gaagcacaac tgtcaagagt cttgtacaca 1140 

caacttcagc tttgcatcac gagtcttgta ttccaagaaa atcaaagtgg tacaatttgt 1200 

ttgtttacac tatgatactt tctaaataaa ctcttttttt ttaaaaaaaa aaaaaaaaaa 1260 

aaaaaaaacg acgaaaaaag aaaaaagggc ggtccgaatc cccgggccag tacgtcccgt 1320 

gtgtaaaggg gccaaaagga cgaattatag acaggcaggg gcgcgttaac acggttgtgc 1380 

ggcaaaacgt tatgggaatt tggaagggac cctcttggtg ggtgacatat tgacacatac 1440 

ctaggaatta gcttaggata ataaaattta gggtttaatg gttaacaagg gattttctgg 1500 

gttaaaattt gctgtcgaaa tttttttgaa aatttttttc tggactttta aataggggtt 1560 

<210> 45 

<211> 919 

<212> DNA 

<213> Homo sapien 

<400> 45 
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tgatggtgac 


tctagagtat 


cccstacgag 


cagtgggctg 


ctctggccct 


ggtgtgcacg 


taactcatgc 


tgtccttgtg 


attaaacacc 


gcttaagctc 


taagatagat 


aggtgtttgt 


accactttgc 


atccaacact 


cttcacccac 


gcccaaagta 


actggtggta 


ggaatcttag 


gcagaagata 


acagcagcat 


ctcgaccagc 


cgtatggttc 


acagataatt 


ctttttttaa 


gtcaagagtc 


ttgtacacac 


aacttcagct 


tcaaagtggt 


acaatttgtt 


tgtttacact 


taaaaaaaaa 


aaaaaaaaaa 


aaaaaaacga 


ccgggccagt 


acgtcccgtg 


tgtaaagggg 


cgcgttaaca 


cggttgtgcg 


gcaaaacgtt 


gtgacatatt 


gacacatacc 


taggaattag 


ttaacaaggg 


attttctggg 


ttaaaatttg 


ggacttttaa 


ataggggtt 




<210> 46 

<211> 618 

<212> DNA 

<213> Homo sapien 




<400> 46 
ccaggcgact 


ctagaggatc 


cgggctccac 


tcaccgagag 


cctagttccg 


gccagggtcg 


gccccggact 


gcaccagagc 


catggtcggc 


gagaggacgt 


ccttcaacta 


tgccatgaag 


ggatgggagg 


tggtggagtc 


ggacctctat 


aaggacatca 


caggtaaact 


gaaggaccct 


ctggcttata 


aagaaggcca 


tctgagccca 


gccgcagacc 


ttgtgatatt 


ccaggtatgg 


tgcgtgctta 


ttgtcctaga 


catgtaccta 


ctttactgca 


ttctctgcaa 


ataagggtga 


tcacagatat 


tgggaatc 





agaagagacg tcaagaccaa agacagatgt 60 

gctgtggcag ctgttgatgc cagtgtcctc 120 

tctatctccc ttgggaataa gcacatacag 180 

ccttttacca tcgagctact tcccataata 240 

ctcccatacg caaggggatg tggatacttg 3 00 

aaacaagacc acttatactg tctgtctgag 360 

ctctgcctta aaggaaatct ttattaatca 420 

aaaaacccaa cctcctagag aagcacaact 4 80 

ttgcatcacg agtcttgtat tccaagaaaa 540 

atgatacttt ctaaataaac tctttttttt 600 

cgaaaaaaga aaaaagggcg gtccgaatcc 660 

ccaaaaggac gaattataga caggcagggg 720 

atgggaattt ggaagggacc ctcttggtgg 780 

cttaggataa taaaatttag ggtttaatgg 840 

ctgtcgaaat ttttttgaaa atttttttct 900 

919 



ttggttgggt tgcagccgga gccgcccagc 60 

ccccggcaac cacgagccca gccaatcagc 120 

agaagagcac tgatcgtact ggctcactca 180 

gaggctgctg cagcggcttt gaagaagaaa 240 

gccatgaact tcaatcccat catttccaga 3 00 

gcgaactttc agtatcctgc cgagtctgtt 360 

gatattgtgg ctgaacaaaa gaagctggaa 420 

ggggacatcg gaaggggtgt cagggacatt 480 

attagctatg agatcttaaa cagttaccca 540 

ttactttaaa ggtgcagtat tacatggatt 600 

618 
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<210> 47 

<211> 932 

<212> DNA 

<213> Homo sapien 

<400> 47 

ttcgagtcgg tacccgcaag gtgaatatga gacctgcaat agcgataatc tgaaccatgc 60 

ggttttggca gtgggatatg gaatccagaa gggaaacaag cactggataa ttaaaaacag 12 0 

ctggggagaa aactggggaa acaaaggata tatcctcatg gctcgaaata agaacaacgc 180 

ctgtggcatt gccaacctgg ccagcttccc caagatgtga ctccagccag ccaaatccat 240 

cctgctcttc catttcttcc acgatggtgc agtgtaacga tgcactttgg aagggagttg 300 

gtgtgctatt tttgaagcag atgtggtgat actgagattg tctgttcagt ttccccattt 360 

gtttgtgctt caaatgatcc ttcctacttt gcttctctcc acccatgacc tttttcactg 420 

tggccatcag gactttccct gacagctgtg tactcttagg ctaagagatg tgactacagc 480 

ctgcccctga ctgtgttgtc ccagggctga tgctgtacag gtacaggctg gagattttca 540 

cataggttag attctcattc acgggactag ttagctttaa gcaccctaga ggactagggt 600 

aatctgactt ctcacttcct aagttccctt ctatatcctc aaggtagaaa tgtctatgtt 660 

ttctactcca attcataaat ctattcataa gtctttggta caagtttaca tgataaaaag 720 

aaatgtgatt tgtcttccct tctttgcact tttgaaataa agtatttatc tcctgtctac 780 

agtttaataa atagcatcta gtacacattc attttgtgtt ggatactgtg ttaggtgctg 840 

gaggaaaaaa gatgaataga acatcttcta tgtacttgat gcgctcacag tctggttgta 900 
gagactgtca cataaacatt tcatcccaat tc 



932 



<210> 48 

<211> 2994 

<212> DNA 

<213> Homo sapien 

<400> 48 

ttcgtctcgc tttgttctgg ttcttcctcg tgtcggtcgt tattgttctt cttccgcttc 60 

gttctcattc gttggtcctt tcccttgctc tcttcctcct tttcctcctc cctcttcgtg 120 

gttctcttcc ggcgctgtgg tcttgttcgt ttctctcggt ctgtcctggc ttccgtcttg 180 

ttggctcttc tccttctgtc ttcttgcgtc cggtttcctg ttgcttgtct gtctttctcg 240 

cttctgctcg tcatctgttt ctccttgttc ttgctttttc tctctccggt ctcgccttct 300 

ttccttgtct cctcctcccc ttttcttctc tttgcgtgtg cttgtctcgc tcgctctgtg 360 

ttcttctgtc tctgcttttg tcgtgtccgc ttgtctcttg tcttcttcgg cttgcttttt 420 

ctcttctctc ctctccgttc gctgctgttt tctgttctcc gtgcctctgt cccttttgtc 480 
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ttcttcgtct 


tcttcgcttc 


cttccgttcc ctccgctcct cttcctccgt 


tcctctcctc 


540 


tcgtcgttcc tcccgctctc ccctttcgct gctgctctgg cttccctcgc 


tggccgtcct 


600 


ccctcttcgc 


ctccctctcc 


ttccctctgt cgtttctcgt tgttgttctt 


gtgtgttgtt 


660 


gtgtgtgctc gtgttgttct 


ggttcttggt tggtggttgt gtggtgtgtg 


cgctgtgtgt 


720 


gttgtttgtt 


gtgtttgtgc gctcgtggtg tactgctgag aagtcacacc 


accagcgtac 


780 


gtcgtttaac 


cggctatcgt gggcgcatcc cccgagggtc tccgagccgg 


gcggtccggc 


840 


ggctgttccc gccttctttt 


ttttgctcct tgggccctgt ccaaacggtc 


ccgtacttgg 


900 


ccctcgaggg aactttggcg cccccttttt ttttttgcat gtccaccttc 


gcttttattg 


960 


agtagttagt 


gttctcaagc 


tggctcacac ccatgcggtc agctggggcc 


tagggtggct 


1020 


ctttgcaaag 


ctgaggggca 


agctaaggaa gccaggcagg tcaggggccc 


tttcggcctt 


1080 


ctcaagcctc 


cacctgagtt 


ctcgtcaatg ccagtctccc tggtatgatt 


ggggacatta 


1140 


tcagagaaac atctaatagc gcacatctgg gcacccacac tctgcttcag 


ttgcatccat 


1200 


cctcccaccc 


caaattcaac 


tcctgaccca atacaaaaga cttttttaac 


caggatttct 


1260 


tcttgcagga 


aagctgactt 


ggaaacacgg ggaggtggca gggaggggca 


aaaaggactc 


1320 


tggcaagcag 


atccacttgt 


ctgggtccct gcagtgaaga acccaagatc 


caggtacctc 


1380 


agcctggaag 


aaaccgtgca 


ctgcaggtct tcccttctat caaaactaaa 


agcaaggaac 


1440 


agaagtcaat 


tgaaacccca 


ggtcatccta cccaccttct ggcagtctct 


ggataatgat 


1500 


ctaaggaaat 


aagcttacat 


ttataattag aggataattt ggagaattta 


tactatttac 


1560 


acggaccacc 


ctgcaaagat 


cagggaaggt aagagactgc tgggcagaga 


atggaaacag 


1620 


gctggggggt 


tcctaatctc 


ttttcaccat tgcctttgtt gttcccaccc 


ttcagctcct 


1680 


ggcagggaca 


gcagaggaat 


cagatactct tggggaacac aaggtcccag 


cagtggctag 


1740 


ggtgtgactg 


ctacttttgc 


ataatcaggc agcagggagc tatgaggagc 


agaacagtcc 


1800 


ctgcctgctg 


tcctcactca 


gaggcttggg gtgcactgag ccttggagtg 


cccgggatgg 


1860 


tggacagctt 


gagcacaggg gcaagggcag ggctctactg tagctcagga 


ataaactggg 


1920 


accagttgat gttgtcaggg cccagtgggt cctcgtccag gccaggaaag 


ctgatgtcca 


1980 


gcaggatctt 


gctgaggctg 


tcattcattg tgtccaggac caggccttct 


gt cagagaac 




gattggctgc aaggccagaa acctgtggct ccggggagcc tggcttgggg 


acgtctatat 


2100 


ctgagggaga 


agagttgcca 


aaggggacgg agatgaggtc taagggttct 


gaactgagga 


2160 


gcctttgcgg tgattcaagg gggggagcac tttgcaaggg agtggtgctg 


agatccatca 


2220 


gccccagggg gtcaggcaag gggtcagagg caccctggga ggtttgtact 


gggctgaaat 


2280 
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ccagtccccc tactttggct gggggcgtga gcctccagga ttcaggggtt ctggggagga 2340 

cagatttgct cggggtggag gagatgggca gcgtttcctt aatgggtgtc ttaaaaggtc 24 00 

ctcccacttc ctgggagtag ctgagctggg aggcagggtc agaggagtct gctgggaacg 2460 

ggagctctgc ggcccagcgg gaagtactgg gcccctctga gaagagcagc tccggctcat 2520 

ccacacaggg aggcagtaga tgctgtttcc tccgagaccg gctcctctcc ctcctctccc 2580 

tgtgttgaat cacaagcatt tccgagacac accgggttgg ggacctaagc ccactgtagg 264 0 

acttcttggg tcttggggtg ggagattggg acgaatcctc ccaggagtga gatgattcct 2700 

ctttgaaaga tggggccggg gagggccact cttccaaggg agggctctcc actttgatgg 2760 

gtctcgctaa gtgtggcatt tcctccccag gctggatttc ttcctccttg atagtctgaa 2820 

ctggaagcaa aggagaaaac ccttctccaa acaggagttt ctcctctttc cctggtcctg 2880 

cagaagaaag aggagctatc ccctcctcag ctagcagcac cttgggggca atgcggactc 2940 

gcttgctatg gcgggcaagc tctgagctca tgagggaagc cgccaggggc aatg 2994 

<210> 49 

<211> 2053 

<212> DNA 

<213> Homo sapien 

<400> 49 

ttcgtctcgc tttgttctgg ttcttcctcg tgtcggtcgt tattgttctt cttccgcttc 60 

gttctcattc gttggtcctt tcccttgctc tcttcctcct tttcctcctc cctcttcgtg 120 

gttctcttcc ggcgctgtgg tcttgttcgt ttctctcggt ctgtcctggc ttccgtcttg 180 

ttggctcttc tccttctgtc ttcttgcgtc cggtttcctg ttgcttgtct gtctttctcg 240 

cttctgctcg tcatctgttt cfcccttgttc ttgctttttc tctctccggt ctcgccttct 3 00 

ttccttgtct cctcctcccc ttttcttctc tttgcgtgtg cttgtctcgc tcgctctgtg 360 

ttcttctgtc tctgcttttg tcgtgtccgc ttgtctcttg tcttcttcgg cttgcttttt 420 

ctcttctctc ctctccgttc gctgctgttt tctgttctcc gtgcctctgt cccttttgtc 480 

ttcttcgtct tcttcgcttc cttccgttcc ctccgctcct cttcctccgt tcctctcctc 540 

tcgtcgttcc tcccgctctc ccctttcgct gctgctctgg cttccctcgc tggccgtcct 600 

ccctcttcgc cfcccctctcc ttccctctgt cgtttctcgt tgttgttctt gtgtgttgtt 660 

gtgtgtgctc gtgttgttct ggttcttggt tggtggttgt gtggtgtgtg cgctgtgtgt 720 

gttgtttgtt gtgtttgtgc gctcgtggtg tactgctgag aagtcacacc accagcgtac 780 

gtcgtttaac cggctatcgt gggcgcatcc cccgagggtc tccgagccgg gcggtccggc 84 0 

ggctgttccc gccttctttt ttttgctcct tgggccctgt ccaaacggtc ccgtacttgg 900 
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ccctcgaggg aactttggcg cccccttttt ttttttgcat gtccaccttc gcttttattg ■ 960 

agtagttagt gttctcaagc tggctcacac ccatgcggtc agctggggcc tagggtggct 1020 

ctttgcaaag ctgaggggca agctaaggaa gccaggcagg tcaggggocc tttcggcctt 1080 

ctcaagcctc cacctgagtt ctcgtcaatg ccagtctccc tggtatgatt ggggacatta 1140 

tcagagaaac atctaatagc gcacatctgg gcacccacac tctgcttcag ttgcatccat 1200 

cctcccaccc caaattcaac tcctgaccca atacaaaaga cttttttaac caggatttct 1260 

tcttgcagga aagctgactt ggaaacacgg ggaggtggca gggaggggca aaaaggactc 1320 

tggcaagcag atccacttgt ctgggtccct gcagtgaaga acccaagatc caggtacctc 1380 

agcctggaag aaaccgtgca ctgcaggtct tcccttctat caaaactaaa agcaaggaac 1440 

agaagtcaat tgaaacccca ggtcatccta cccaccttct ggcagtctct ggataatgat 1500 

ctaaggaaat aagcttacat ttataattag aggataattt ggagaattta tactatttac 1560 

acggaccacc ctgcaaagat cagggaaggt aagagactgc tgggcagaga atggaaacag 1620 

gctggggggt tcctaatctc ttttcaccat tgcctttgtt gttcccaccc ttcagctcct 1680 

ggcagggaca gcagaggaat cagatactct tggggaacac aaggtcccag cagtggctag 1740 

ggtgtgactg ctacttttgc ataatcaggc agcagggagc tatgaggagc agaacagtcc 1800 

ctgcctgctg tcctcactca gaggcttggg gtgcactgag ccttggagtg cccgggatgg i860 

tggacagctt gagcacaggg gcaagggcag ggctctactg tagctcagga ataaactggg 1920 

accagttgat gttgtcaggg cccagtgggt cctcgtccag gccaggaaag ctgatgtcca 1980 

gcaggatctt gctgaggctg tcattcattg tgtccaggac caggccttct caaccccagc 2040 
ctcgywccta gcg 



2053 



<210> 50 

<211> 2999 

<212> DNA 

<213> Homo sapien 

<400> 50 

agaagaggac aggaaagaaa cttcagtctg agggttgagg acacgtcccg gagcagagga 60 

ggaccaagct cccaagcccc ctgtcacctt gacacgcaga cggttcccca ggactgcgaa 120 

9aggggggca accagcgccc cgccccgccc cgctgcgccg cccgccattg gccggtccga 180 

aagacgcggg gcgttgtcgg tttgaatgag agcgcctgga gccggggcgc aggcgcagag 240 

gagggcggtg ttgagaccgg cggagcggcg ggacccctag gtggcggagg gacgctccgg 300 

gaaagcgagg ggcgctacga gctctggccc acgtgacctg ccgggggcgg gagcaggggg 360 

cgcgccggcc tcctgcggtg cccctgcctt ggggaggggc cgtgaccacc cgtctgtcgc 420 
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ccgaggcggc cgccgctgca ccttcaccgc gtacccggga cccgcccgcc cgcgggagct 480 

tataatatga ctcgatggag gaaaatttga taagcatgag agaagaccat tcttttcatg 540 

ttcgttacag aatggaagct tcttgcctag agctggcctt ggaaggggaa cgtctatgta 600 

aatcaggaga ctgccgcgct ggcgtgtcat tctttgaagc tgcagttcaa gttggaactg 660 

aagacctaaa aacacttagc gctatttaca gccagttggg caatgcttat tfcctatttgc 720 

atgattatgc caaagcatta gaatatcacc atcatgattt aacccttgca aggactattg 780 

gagaccagct gggggaagcg aaagctagtg gtaatctggg aaacacctta aaagttcttg 84 0 

ggaattttga cgaagccata gtttgttgtc agcgacacct agatatttcc agagagctta 900 

atgacaaggt gggagaagca agagcacttt acaatcttgg gaatgtgtat catgccaaag 960 

ggaaaagttt tggttgccct ggtccccagg atgtaggaga atttccagaa gaagtgagag 1020 

atgctctgca ggcagccgtg gatttttatg aggaaaacct atcattagtg actgctttgg 1080 

gtgaccgagc ggcacaagga cgtgcctttg gaaatcttgg aaacacacat tacctccttg 114 0 

gcaacttcag ggatgcagtt atagctcatg agcagcgtct ccttattgca aaagaatttg 1200 

gagataaagc agctgaaaga agagcatata gcaaccttgg aaatgcatat atatttcttg 1260 

gtgaatttga aactgcctcg gaatactaca agaagacact actgttggcc cgacagctta 1320 

aagaccgagc tgtagaagca cagtcttgtt acagtcttgg aaatacatat actttacttc 1380 

aagactatga aaaggccatt gattatcatc tgaagcactt agcaattgct caagagctga 1440 

atgatagaat tggtgaagga agagcatgtt ggagcttagg aaatgcatac acagcactag 1500 

gaaatcatga tcaagcaatg cattttgctg aaaagcactt ggaaatttca agagaggttg 1560 

gggataaaag tggtgaacta acagcacgac ttaatctctc agaccttcaa atggttcttg 1620 

gtctgagcta cagcacaaat aactccataa tgtctgaaaa tactgaaatt gatagcagtt 1680 

tgaatggtgt acgccccaag ttgggacgcc ggcatagtat ggaaaatatg gaacttatga 174 0 

agttaacacc agaaaaggta cagaactgga acagtgaaat tcttgctaag caaaaacctc 1800 

ttattgccaa accttctgca aagctactct ttgtcaacag actgaagggg aaaaaataca 1860 

aaacgaattc ctccactaaa gttctccaag atgccagtaa ttctattgac caccgaattc 1920 

caaattctca gaggaaaatc agtgcagata ctattggaga tgaagggttc tttgacttat 1980 

taagccgatt tcaaagcaat aggatggatg atcagagatg ttgcttacaa gaaaagaact 2040 

gccatacagc ttcaacaaca acttcttcca ctccccctaa aatgatgcta aaaacatcat 2100 

ctgttcctgt ggtatccccc aacacggatg agtttttaga tcttcttgcc agctcacaga 2160 

gtcgccgtct ggatgaccag agggctagtt tcagtaattt gccagggctt cgtctaacac 2220 

aaaacagcca gtcggtactt agccacctga tgactaatga caacaaagag gctgatgaag 2280 
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atttctttga catccttgta aaatgtcaag gatccagatt agatgatcaa agatgtgctc 2340 

caccacctgc taccacaaag ggtccgacag taccagatga agactttttc agccttattt 2400 

tacggtccca gggaaagaga atggatgaac agagagttct tttacaaaga gatcaaaaca 2460 

gagacactga ctttgggcta aaggactttt tgcaaaataa tgctttgttg gagtttaaaa 2520 

attcagggaa aaaatcggca gaccattagt tactatggat ttattttttt tcctttcaaa 2580 

cacggtaagg aaacaatcta ttactttttt ccttaaaagg agaatttata gcactgtaat 2640 

acagcttaaa atatttttag aatgatgtaa atagttaacc ttcagtagtc tattaaggca 2700 

ttaatacttc tctggacatg cgcgtttgag ggtggagggg tcctgtaagg tgcttcatcg 2760 

tctgtgatta ctgcttggga tgtgttcttt ggcagcttgt gagattactt tacctagtgt 2820 

ttataaagta ggaagttaag tgaatcatag attagaattt aatactctta tggaaataat 2880 

tttttaacat cttaattgac aatggcgttt ttttatacat aaccatggat gtagtgggaa 2940 

acaatgttgt ttggtaaaaa taatgtactt gatcaatgta aaaaagtata taaaatagt 2999 



<210> 51 

<211> 3045 

<212> DNA 

<213> Homo sapien 



<400> 51 

agaagaggac aggaaagaaa cttcagtctg agggttgagg acacgtcccg gagcagagga 60 

ggaccaagct cccaagcccc ctgtcacctt gacacgcaga cggttcccca ggactgcgaa 120 

gaggggggca accagcgccc cgccccgccc cgctgcgccg cccgccattg gccggtccga 180 

aagacgcggg gcgttgtcgg tttgaatgag agcgcctgga gccggggcgc aggcgcagag 240 

gagggcggtg ttgagaccgg cggagcggcg ggacccctag gtggcggagg gacgctccgg 300 

gaaagcgagg ggcgctacga gctctggccc acgtgacctg ccgggggcgg gagcaggggg 360 

cgcgccggcc tcctgcggtg cccctgcctt ggggaggggc cgtgaccacc cgtctgtcgc 420 

ccgaggcggc cgccgctgca ccttcaccgc gtacccggga cccgcccgcc cgcgggagaa 480 

cccacagatc tggaggacct actataattt aatttaattc taagcttata atatgactcg 54 0 

atggaggaaa atttgataag catgagagaa gaccattctt ttcatgttcg ttacagaatg 600 

gaagcttctt gcctagagct ggccttggaa ggggaacgtc tatgtaaatc aggagactgc 660 

cgcgctggcg tgtcattctt tgaagctgca gttcaagttg gaactgaaga cctaaaaaca 720 

cttagcgcta tttacagcca gttgggcaat gcttatttct atttgcatga ttatgccaaa 780 

gcattagaat atcaccatca tgatttaacc cttgcaagga ctattggaga qcagctgggg 840 

gaagcgaaag ctagtggtaa tctgggaaac accttaaaag ttcttgggaa ttttgacgaa 900 
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gccatagttt gttgtcagcg acacctagat atttccagag agcttaatga caaggtggga 960 

gaagcaagag cactttacaa tcttgggaat gtgtatcatg ccaaagggaa aagttttggt 1020 

tgccctggtc cccaggatgt aggagaattt ccagaagaag tgagagatgc tctgcaggca 1080 

gccgtggatt tttatgagga aaacctatca ttagtgactg ctttgggtga ccgagcggca 1140 

caaggacgtg cctttggaaa tcttggaaac acacattacc tccttggcaa cttcagggat 1200 

gcagttatag ctcatgagca gcgtctcctt attgcaaaag aatttggaga taaagcagct 1260 

gaaagaagag catatagcaa ccttggaaat gcatatatat ttcttggtga atttgaaact 1320 

gcctcggaat actacaagaa gacactactg ttggcccgac agcttaaaga ccgagctgta 1380 

gaagcacagt cttgttacag tcttggaaat acatatactt tacttcaaga ctatgaaaag 1440 

gccattgatt atcatctgaa gcacttagca attgctcaag agctgaatga tagaattggt 1500 

gaaggaagag catgttggag cttaggaaat gcatacacag cactaggaaa tcatgatcaa 1560 

gcaatgcatt ttgctgaaaa gcacttggaa atttcaagag aggttgggga taaaagtggt 1620 

gaactaacag cacgacttaa tctctcagac cttcaaatgg ttcttggtct gagctacagc 1680 

acaaataact ccataatgtc tgaaaatact gaaattgata gcagtttgaa tggtgtacgc 174 0 

cccaagttgg gacgccggca tagtatggaa aatatggaac ttatgaagtt aacaccagaa 1800 

aaggtacaga actggaacag tgaaattctt gctaagcaaa aacctcttat tgccaaacct 1860 

tctgcaaagc tactctttgt caacagactg aaggggaaaa aatacaaaac gaattcctcc 192 0 

actaaagttc tccaagatgc cagtaattct attgaccacc gaattccaaa ttctcagagg 1980 

aaaatcagtg cagatactat tggagatgaa gggttctttg acttattaag ccgatttcaa 204 0 

agcaatagga tggatgatca gagatgttgc ttacaagaaa agaactgcca tacagcttca 2100 

acaacaactt cttccactcc ccctaaaatg atgctaaaaa catcatctgt tcctgtggta 2160 

tcccccaaca cggatgagtt tttagatctt cttgccagct cacagagtcg ccgtctggat 222 0 

gaccagaggg ctagtttcag taatttgcca gggcttcgtc taacacaaaa cagccagtcg 2280 

gtacttagcc acctgatgac taatgacaac aaagaggctg atgaagattt ctttgacatc 2340 

cttgtaaaat gtcaaggatc cagattagat gatcaaagat gtgctccacc acctgctacc 2400 

acaaagggtc cgacagtacc agatgaagac tttttcagcc ttattttacg gtcccaggga 2460 

aagagaatgg atgaacagag agttctttta caaagagatc aaaacagaga cactgacttt 2520 

gggctaaagg actttttgca aaataatgct ttgttggagt ttaaaaattc agggaaaaaa 2580 

tcggcagacc attagttact atggatttat tttttttcct ttcaaacacg gtaaggaaac 264 0 

aatctattac ttttttcctt aaaaggagaa tttatagcac tgtaatacag cttaaaatat 2700 
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ttttagaatg atgtaaatag 


ttaaccttca 


gtagtctatt 


aaggcattaa 


tacttctctg 


2760 


gacatgcgcg tttgagggtg 


gaggggtcct 


gtaaggtgct 


tcatcgtctg 


tgattactgc 


2820 


ttgggatgtg ttctttggca 


gcttgtgaga 


ttactttacc 


tagtgtttat aaagtaggaa 


2880 


gttaagtgaa tcatagatta 


gaatttaata 


ctcttatgga 


aataattttt 


taacatctta 


2940 


attgacaatg gcgttttttt 


atacataacc 


atggatgtag 


tgggaaacaa 


tgttgtttgg 


3000 


taaaaataat gtacttgatc 


aatgtaaaaa 


agtatataaa 


atagt 




3045 



<210> 52 

<211> 1650 

<212> DNA 

<213> Homo sapien 

<400> 52 



gaggccaaac 


atggggcggg 


gcggcgcggc 


cggggaagcg 


tgatgaaggc 


ctacgagtgc 


60 


ggcgcggcct 


gaaggggcac 


gcgggggacc 


tgcaaagcta 


gtgaggggcg 


gggcaggcgg 


120 


cgcggtgggg 


gcgggccgag 


cccggaggcc 


agatgagcgg 


acacagcccc 


acgcgcgggg 


180 


ccatgcaggt 


ggccatgaac 


ggtaaggccc 


gcaaagaggc 


ggtgcagact 


gcggctaagg 


240 


aactcctcaa 


gttcgtgaac 


cggagtccct 


ctcctttcca 


tgctgtggct 


gaatgccgca 


300 


accgccttct 


ccaggctggc 


ttcagtgaac 


tcaaggagac 


tgagaaatgg 


aatattaagc 


360 


ccgagagcaa 


gtacttcatg 


accaggaact 


cctccaccat 


catagctttt 


gctgtagggg 


420 


gccagtacgt 


tcctggcaat 


ggcttcagcc 


tcatcggggc 


ccacacggac 


agcccctgcc 


480 


tccgggtgaa 


acgtcggtct 


cgccgcagcc 


a 99tgggctt 


ccagcaagtc 


ggtgtggaga 


540 


cctatggtgg 


tgggatctgg 


agcacctggt 


ttgaccgtga 


cctgactctg gctggacgcg 


600 


tcattgtcaa 


gtgccctacc 


tcaggtcggc 


tggagcagca 


gctggtgcac gtggagcggc 


660 


ccattcttcg 


catcccacac 


ctggccatcc 


atctgcagcg 


aaatatcaac 


gagaactttg 


720 


ggcccaacac 


agagatgcat 


ctagtcccca 


ttcttgccac 


agccatccag 


gaggagctgg 


780 


agaaggggac 


tcctgagcca 


gggcctctca 


atgctgtgga 


tgagcggcac 


cattcggtcc 


840 


tcatgtccct 


gctctgtgcc 


catctggggc 


tgagccccaa 


ggacatagtg gagatggagc 


900 


tctgccttgc 


agacacccag 


cctgcggtct 


tgggtggtgc 


ctatgatgag 


ttcatctttg 


960 


ctcctcggct 


ggacaatctg 


cacagctgct 


tctgtgccct 


gcaggccttg 


atagattcct 


1020 


gtgcaggccc 


tggctccctg 


gccacagagc 


ctcacgtgcg 


catggtcaca 


ctctatgaca 


1080 


acgaagaggt 


ggggtctgag 


agtgcacagg 


gagcacagtc 


actgctgaca 


gagctggtgc 


1140 


tgcggcggat 


ctcagcctcg 


tgccagcacc 


cgacagcctt 


cgaggaagcc atacccaagt 


1200 


ccttcatgat 


cagcgcagac 


atggcccatg 


ctgtgcatcc 


caactacctg gacaagcatg 


1260 
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aggagaacca ccggccttta ttccacaagg gccccgtgat caaggtggca aaggccccaa 1320 

cgttgtaggc cctacgggct actacaaccc ttcggtgacg ccataaaaac tcttcaccaa 1380 

agagccctta aacccgcaaa tctaccatca acctttaaat caacggccga cttagctttc 1440 

aacatcgtct tattattgaa acccccctcc atagcgaacc ccgggtcaac tacactaagg 1500 

tctatttatt gaggccctta cgagcgggca cccacccctg ccagggggga accgctgccg 1560 

tggccggcga cgccccacac tcttcgccca ttttgagtgc cacacgcgcc gcgcaaaagc 1620 

acaaacaaca aggggcacgg cgccgcatga 1650 

<210> 53 

<211> 1204 

<212> DNA 

<213> Homo sapien 

<400> 53 

ctcfcacagga tccccggtgt ctcgccacam agattmcatc ctcatgcatc taatcacaga 60 

cgatgaagca ccttacagga cccctccacc ctcaaacgcg catgtccaga gaagtattaa 120 

tgccttaata gactactgaa ggttaactat ttacatcatt ctaaaaatat tttaagctgt 180 

attacagtgc tataaattct ccttttaagg aaaaaagtaa tagattgttt ccttaccgtg 24 0 

tttgaaagga aaaaaaataa atccatagta actaatggtc tgccgatttt ttccctgaat 3 00 

ttttaaactc caacaaagca ttattttgca aaaagtcctt tagcccaaag tcagtgtctc 360 

tgttttgatc tctttgtaaa agaactctct gttcatccat tctctttccc tgggaccgta 420 

aaataaggct gaaaaagtct tcatctggta ctgtcggacc ctttgtggta gcaggtggtg 480 

gagcacatct ttgatcatct aatctggatc cctagaagtt tatgagagag attaaaagac 540 

atactgtggg aaagaaagca ccaagctatg ttaaatctct cagtcacaac aagaaaacgc 600 

agttgtaaac ttttatttgg ggcttctcag aaattaatct ttgtttttat attgttctct 660 

ttagttccca gcctgatttc tgaaagaaat aatctttaaa cctgaaactg cataaaattt 720 

ttcctttaaa atggcttata taggtatagc tgcttcattg gtttagtacg tgcattatgt 780 

taagtcatac tgttttaatt ataattggtc acaaggataa aagacaaggt caactgtatc 840 

taagtatgac agttcaaggt tataatctaa caggcatcaa ctgtgtaact taacaaatct 900 

ttagaaatat gtacaaaccc catgaggaac taccttaaaa aaatcacgtc ttaggctagg 960 

ggacagcaaa ctgtaaccca gtgccggctt ctgtaaatga agtcattttc attttcccat 1020 

caatacatat gfcttatatac tatctatatg gctgcttccc tactaaatgg cagagttggg 1080 

cagttgcaat agataccata tgggcggcaa agcctaaaat attttctatc tggcctttta 1140 

cagaaaaagt ttgcctgcct ctgtcttagg ctgttaaatc tgttaagaaa tagcccagaa 1200 
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tgaa 1204 

<210> 54 

<211> 265 

<212> DNA 

<213> Homo sapien 

<400> 54 

cactagtacg gcgcagtgtg cggaattcgg tttcgagcgg cgcccgggag tactgtgaat 60 

aatgatgaag gagagtggag tggcccacca cctgaatgca gaggaaaatc tctaacttcc 12 0 

aaggtcccac caacagttca gaaacctacc acagtaaatg ttccaactac agaagtctca 180 

ccaacttctc agaaaaccac cacaaaaacc accacaccaa atgctcaagc aacacggagt 240 

acctcggccg cgaccacgct aagcc 265 

<210> 55 
<211> 2326 
<212> DNA 
<213> Homo sapien 

<400> 55 

ggcttagcgt ggtcgcggcc gaggtacagg gtgggagatg ggggaggggc tgggggctga 60 

agggagcaag aaagctgtag ctgtgtgggg ctggcaggat gttgaagacc gcctctgctg 12 0 

ctctcacatg gggactgggc ccagatcctg cttggtcaca cccagcccag aagacgggtc 180 

ctccagttcc agtgactcac tgcagttatg gtgcccgcgg cttctggtgt tggggacctc 24 0 

cgtgccgttg gggggaggag aagcaggagc tgtcgggaag atcagaagcc agtcatggat 300 

gaccagcgcg accttatctc caacaatgag caactgccca tgctgggccg gcgccctggg 3 60 

gccccggaga gcaagtgcag ccgcggagcc ctgtacacag gcttttccat cctggtgact 420 

ctgctcctcg ctggccaggc caccaccgcc tacttcctgt accagcagca gggccggctg 480 

gacaaactga cagtcacctc ccagaacctg cagctggaga acctgcgcat gaagcttccc 54 0 

aagcctccca agcctgtgag caagatgcgc atggccaccc cgctgctgat gcaggcgctg 600 

cccatgggag ccctgcccca ggggcccatg cagaatgcca ccaagtatgg caacatgaca 660 

gaggaccatg tgatgcacct gctccagaat gctgaccccc tgaaggtgta cccgccactg 72 0 

aaggggagct tcccggagaa cctgagacac cttaagaaca ccatggagac catagactgg 78 0 

aaggtctttg agagctggat gcaccattgg ctcctgtttg aaatgagcag gcactccttg 840 

gagcaaaagc ccactgacgc tccaccgaaa gtactgacca agtgccagga agaggtcagc 900 

cacatccctg ctgtccaccc gggttcattc aggcccaagt gcgacgagaa cggcaactat 960 

ctgccactcc agtgctatgg gagcatcggc tactgctggt gtgtcttccc caacggcacg 1020 

gaggtcccca acaccagaag ccgcgggcac cataactgca gtggtaagca gtggcactgt 1080 
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gccagtgtca 


gaggaccagg 


aaggactagg 


aaggttgagg 


ggcaagaggt 


cccctctgaa 


1140 


gcacatggga 


ccaggacacc 


caggatggca 


gctcctgggg 


gcagtgacgt 


agtcatgcgt 


1200 


ccagcccttt 


atccatccac 


ccacctgtgc 


attcctattt 


gtccatttat 


catctctcct 


1260 


cttacatgca 


ctcatatttg 


ttcacttatc 


cgtcagtcct 


ttcatctgtg aatttcatcc 


1320 


atccacgcgc 


catcctctag 


catccaggag 


tcctacagac 


ccacccatct 


catttcatca 


1380 


cccccttctc 


actcgagccc 


ccattacacc 


tctttgttgc 


tgcttgcagc 


tgtccttccc 


1440 


tgggtacctg 


ctttcccagg 


cactaagcct 


gtggctaggt 


gggaagcact 


gccctcaggc 


1500 


atcttgggtc 


aggtaggctg 


cacttcaagt 


gacaaacgga 


cttgctgctc 


ctttgcagag 


1560 


tcactggaac 


tggaggaccc 


gtcttctggg 


ctgggtgtga 


ccaagcagga 


tctgggccca 


1620 


gtccccatgt 


gagagcagca 


gaggcggtct 


tcaacatcct 


gccagcccca 


cacagctaca 


1680 


gctttcttgc 


tcccttcagc 


ccccagcccc 


tcccccatct 


cccaccctgt 


acctcatccc 


1740 


atgagaccct 


ggtgcctggc 


tctttcgtca 


cccttggaca 


agacaaacca agtcggaaca 


1800 


gcagataaca 


atgcagcaag 


gccctgctgc 


ccaatctcca 


tctgtcaaca ggggcgtgag 


1860 


gtcccaggaa 


gtggccaaaa 


gctagacaga 


tccccgttcc 


tgacatcaca 


gcagcctcca 


1920 


acacaaggct 


ccaagaccta 


ggctcatgga 


cgagatggga 


aggcacaggg 


agaagggata 


1980 


accctacacc 


cagaccccag 


gctggacatg 


ctgactgtcc 


tctcccctcc 


agcctttggc 


2040 


cttggctttt 


ctagcctatt 


tacctgcagg 


ctgagccact 


ctcttccctt 


tccccagcat 


2100 


cactccccaa 


ggaagagcca 


atgttttcca 


cccataatcc 


tttctgccga 


cccctagttc 


2160 


cctctgctca 


gccaagcttg 


ttatcagctt 


tcagggccat 


ggttcacatt 


agaataaaag 


2220 


gtagtaatag 


aaaaaaaaaa 


aaaacccggg 


ccttatggcg 


ggggggaaag 


gggacaggaa 


2280 


gggcatggct 


agggttcggc 


cggtcttgac 


aggggattgc 


gggcca 




2326 



<210> 56 

<211> 1001 

<212> DNA 

<213> Homo sapien 

<400> 56 



attctgcatg ggccctgggg 


cagggctssc 


a tg9gcagcg 


ctgcatcaga 


atgctgaccc 


60 


cctgaaggtg tacccgccac 


tgaaggggag 


cttcccggag 


aacctgagac 


accttaagaa 


120 


caccatggag accatagact 


ggaaggtctt 


tgagagctgg 


atgcaccatt 


ggctcctgtt 


180 


tgaaatgagc aggcactcct 


tggagcaaaa 


gcccactgac 


gctccaccga 


aagagtcact 


240 


ggaactggag gacccgtctt 


ctgggctggg 


tgtgaccaag 


caggatctgg 


gcccagtccc 


300 


catgtgagag cagcagaggc 


ggtcttcaac 


atcctgccag 


ccccacacag 


ctacagcttt 


360 



WO 2004/050858 



83/383 



PCT/US2003/038808 



cttgctccct tcagccccca gcccctcccc catctcccac cctgtacctc atcccatgag 420 

accctggtgc ctggctcttt cgtcaccctt ggacaagaca aaccaagtcg gaacagcaga 480 

taacaatgca gcaaggccct gctgcccaat ctccatctgt caacaggggc gtgaggtccc 54 o 

aggaagtggc caaaagctag acagatcccc gttcctgaca tcacagcagc ctccaacaca 600 

aggctccaag acctaggctc atggacgaga tgggaaggca cagggagaag ggataaccct 660 

acacccagac cccaggctgg acatgctgac tgtcctctcc cctccagcct ttggccttgg 720 

cttttctagc ctatttacct gcaggctgag ccactctctt ccctttcccc agcatcactc 780 

cccaaggaag agccaatgtt ttccacccat aatcctttct gccgacccct agttccctct 840 

gctcagccaa gcttgttatc agctttcagg gccatggttc acattagaat aaaaggtagt 900 

aatagaaaaa aaaaaaaaac ccgggcctta tggcgggggg gaaaggggac aggaagggca 960 
tggctagggt tcggccggtc ttgacagggg attgcgggcc a 



1001 



<210> 57 
<211> 1211 
<212> DNA 
<213> Homo sapien 

<400> 57 

caccagaagt gaggacggct gcctgggccc tgccgtgtgc tgctgggggg gctggctcca 60 

tggagggggt cctctgccct gctggctggt ggctgaggcc ccaggcctgt gcccagggcc 120 

gctggcagca cagtgaggcc acgtccacag cagagttggc cgtggagata gtgtacgtgg 180 

tgccctccgt agctgagagc ctcccagccc cacactggct gagtacccag ggcttccaga 240 

atttggaaaa aggagcagtg ttttggtcgt ggagctgtgg ggtgccgtgc gcgtccatgg 300 

gctgggtctg cgctcaggct gccctgccgt ggaagtgtgg gtgtgggtat gggttggggg 360 

tctgggaccc caggcagccc aagtgggagc agggccggcc ggtggggaag ggaggctctg 420 

gcctggtggg aagtgcagct ccgcgttgtc ccttctcttg cagcggggat ctgacgagct 480 

tttctctact tgtgtcacta acggaccgtt tatcatgagc agcaactcgg cttctgcagc 540 
aaacggaaat gacagcaaga agttcaaagg tgacagccga agtgcaggcg tcccctctag • 600 

agtgatccac atccggaagc tccccatcga cgtcacggag ggggaagtca tctccctggg 660 

gctgcccttt gggaaggtca ccaacctcct gatgctgaag gggaaaaacc aggccttcat 720 

cgagatgaac acggaggagg ctgccaacac catggtgaac tactacacct cggtgacccc 780 

tgtgctgcgc ggccagccca tctacatcca gtttctctgt gctctttcta ccgcccccgc 840 

gtcctgtccc gggggctctc ctaggatccc ctttccgtaa aagcgtgtaa caagggtgta 900 

aatatttata attttttata cctgttgtga gacccgaggg gcggcggcgc ggttttttat 960 
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ggtgacacaa atgtatattt tgctaacagc aattccaggc tcagtattgt gaccgcggag 1020 

ccacagggga ccccacgcac attccgttgc cttacccgat ggcttgtgac gcggagagaa 1080 

ccgattaaaa ccgtttgaga aactcctccc ttgtctagcc ctgtgttcgc tgtggacgct 1140 

gtagaggcag gttggccagt ctgtacctgg acttcgaata aatcttctgt atcctcgctc 1200 

cgttccgcct t 1211 

<210> 58 

<211> 5572 

<212> DNA 

<213> Homo sapien 

<400> 58 

caccgtggtg ttggcagcct cctccgtgtt catctcgatg aaggcctggt ttttcccctt 60 

cagcatcagg aggttggtga ccttcccaaa gggcagcccc agggagatga cttccccctc 120 

cgtgacgtcg atggggctat tccggcgcct ccactccgtc ccccgcgggt ctgctctgtg 180 

tgccatggac ggcattgtcc cagatatagc cgttggtaca aagcggggat ctgacgagct 240 

tttctctact tgtgtcacta acggaccgtt tatcatgagc agcaactcgg cttctgcagc 300 

aaacggaaat gacagcaaga agttcaaagg tgacagccga agtgcaggcg tcccctctag 360 

agtgatccac atccggaagc tccccatcga cgtcacggag ggggaagtca tctccctggg 420 

gctgcccttt gggaaggtca ccaacctcct gatgctgaag gggaaaaacc aggtacctga 480 

gccgcgtttc tccggggtgc tcacaccgtg caggcgggga cgaggagggc ccagcgctca 540 

ctgcctcccc aacaggcctt catcgagatg aacacggagg aggctgccaa caccatggtg 600 

aactactaca cctcggtgac ccctgtgctg cgcggccagc ccatctacat ccagttctcc 660 

aaccacaagg agctgaagac cgacagctct cccaaccagg cgcgggccca ggcggccctg 720 

caggcggtga actcggtcca gtcggggaac ctggccttgg ctgcctcggc ggcggccgtg 780 

gacgcaggga tggcgatggc cgggcagagc cccgtgctca ggatcatcgt ggagaacctc 840 

ttctaccctg tgaccctgga tgtgctgcac cagattttct ccaagttcgg cacagtgttg 900 

aagatcatca ccttcaccaa gaacaaccag ttccaggccc tgctgcagta tgcggacccc 960 

gtgagcgccc agcacgccaa gctgtcgctg gacgggcaga acatctacaa cgcctgctgc 1020 

acgctgcgca tcgacttttc caagctcacc agcctcaacg tcaagtacaa caatgacaag 1080 

agccgtgact acacacgccc agacctgcct tccggggaca gccagccctc gctggaccag 1140 

accatggccg cggccttcgc ctctccgtat gcaggagctg gtttccctcc cacctttgcc 1200 

attcctcaag ctgcagggta ttcaaacgct tggtcttggt tccccagcga ctgcatgccc 1260 

acaccacctt cccaggcagc tccgcatcca cggcggcagc ctgggcggac tgggcactcg 1320 
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agtgccaggt caggggccct tcccgggaga ggggacgcca cgtccacaga gcaggcttgg 13 80 

ccagggcagt ggtaggacag ggctgtggag gcccacgtgt ggacggcgcc agccagaagc 1440 

cgctaatcgc acagtctttg tggtctggtt cccttcaagc acctccgggc gagcgcgagg 1500 

accgccagtg gttggaggga tgtctctagt agttgaattt gagtggcctg gtaagcgcgc 1560 

ggcccgaggc tgagcagcat gaccggcggc gggacgtgtg tgcgtggccg tcctcccgct 1620 

tcccttctgg gaccggggcc cggccgcccc agtgctggcc gtggacggag cagcgcgtgc 1680 

cgcccggcgg ccgcctcgtc tgcatggagg catgggcgcg catgcgcggt ggtcccggga 1740 

cgggccctgc ttgccgaggg ccccgtgtct gtgctggcgg agccggagct tttctgtgct 1800 

gtgggcgggg cgcatgcaga tgagcccagg cccggcccgg cccgtgctgt gaggagaggc 1860 

gggcgctggt gcaggagccg gcgggtggct gtgcgggggt cgggcgaccc cacaggcacc 1920 

cagggtaggg ccagagccag ggccgcctcc cgcgcggctc ggctcctcgg tgcatgagga 1980 

cggggagcgt cggcctctcc cactctgcgg tggagtcggg ggcgccgccg ctcatctcac 2040 

ctcctgcttt tccaggcctt tccgttccga acgtccacgg cgccctggcc cccctggcca 2100 

tcccctcggc ggcggcggca gctgcggcgg caggtcggat cgccatcccg. ggcctggcgg 2160 

gggcaggaaa ttctgtattg ctggtcagca acctcaaccc agaggtacgt gggcttttcc 2220 

tccgcgccgc cgttcctccc ggaagagcga gcaggggatg ttgtgctcgg gctcgggtcc 2280 

cggagcagcg ccgcccctcg ctgtgtctgc gcctctgccc cggcagcccc tctgctgcag 2340 

cgctgagacc ctccttttcc aagatggctt gagttttcct cttttcctga attcacatct 2400 

tggttcgcgt tttctcttgc atgatacgca gaatgaatta ttttttggtt acccaaaggc 2460 

aatctaaaaa ttacttggag ctgagcaggg cgcgcagttc agagccccag caggccgccc 2520 

tgctggtcag cgggacacac agggttagcg cggcggggtg tgggcagccc ctgtggctcg 2580 

agagcctgtc acttaactgg tggatccgtt atttcccaga atggttaaat tcccctcggg 2640 

gagagcttgg gtgaaacaag aggcccaaaa tatgtaactc ttgatggatg ggatcctcct 2700 

ggaatcagca tttcaggctg ttctgtttcc cggcaactcc tgtgacttcc tcaaggcggt 2760 

gcccagcctg gccctccctc gcgggggccc aggacgcact ccctgcctgt gcctcttacc 2820 

acgagccggg tgctgtccct ccagctcctc ctgagcgagg ccgtgcacta acagactcga 2880 

tctcccttca cgcctgtaat cgtccttaga gactgacttg tgccacagct gcgggagaca 2940 

gcgctgctgc agggcttggc tgaggtggcc gcgagcggag ctgcacgcgc gctcctgggg 3000 

ggcgtggttg ggatcggctt ctgggagccc ctccagcagc gctgtctctg ccggcatgtg 3060 

agtgaaggtg cttctcttac cgtgtgcccg tcgaggtaaa cctgtaactg gaatgtgtgt 3120 
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ggagtgtgac 


tgatagaaca 


ctacctgatt 


cttatgtatt 


tactgacctg 


tgtttttttg 


3180 


ctactttttt 


tcttttctcc 


ccttcccctt 


tccctatttt 


ttttcttgcc 


ctgatccgga 


3240 


atttctttgc 


caactgactg 


cacggtactt 


ctgcttcctg 


ttgttgcttg 


aaacaaaaca 


3300 


aaaacataaa 


caaataaaaa 


acaaaaattc 


cccctcaaac 


cctgctctcc 


ggaaaccaac 


3360 


ctgcccttga 


atattaacat 


cctgacaact 


tcatcatcca 


tcaaccactg 


cacgcctgcg 


3420 


gggactgtct 


tcctcgtgtg 


gacgattggc 


aactcgcccc 


ccttgacctc 


tccctctccc 


3480 


ctgtccctcc 


gctgccttgc 


tctgctgtct 


ctaaagagag 


tcacacccca 


aagcctcttt 


3540 


attcttttcg 


gcgtctacgg 


tgacgtgcag 


cgcgtgaaga 


tcctgttcaa 


taagaaggag 


3600 


aacgccctag 


tgcagatggc 


ggacggcaac 


caggcccagc 


tggccatgag 


ccacctgaac 


3660 


gggcacaagc 


tgcacgggaa 


gcccatccgc 


atcacgctct 


cgaagcacca 


gaacgtgcag 


3720 


ctgccccgcg 


agggccagga 


ggaccagggc 


ctgaccaagg 


actacggcaa 


ctcacccctg 


378.0 


caccgcttca 


agaagccggg 


ctccaagaac 


ttccagaaca 


tattcccgcc 


ctcggccacg 


3840 


ctgcacctct 


ccaacatccc 


gccctcagtc 


tccgaggagg 


atctcaaggt 


cctgttttcc 


3900 


agcaatgggg 


gcgtcgtcaa 


aggattcaag 


ttcttccaga 


aggaccgcaa 


gatggcactg 


3960 


atccagatgg 


gctccgtgga 


ggaggcggtc 


caggccctca 


ttgacctgca 


caaccacgac 


4020 


ctcggggaga 


accaccacct 


gcgggtctcc 


ttctccaagt 


ccaccatcta 


ggggcacagg 


4080 


cccccacggc 


cgggccccct 


ggcgacaact 


tccatcattc 


cagagaaaag 


ccactttaaa 


4140 


aacagctgaa 


gtgaccttag 


cagaccagag 


attttatttt 


tttaaagaga 


aatcagttta 


4200 


cctgttttta 


aaaaaattaa 


atctagttca 


ccttgctcac 


cctgcggtga 


cagggacagc 


4260 


tcaggctctt 


ggtgactgtg 


gcagcgggag 


ttcccggccc 


tccacacccg 


gggccagacc 


4320 


ctcggggcca 


tgccttggtg 


gggcctgtgt 


cgggcgtggg 


gcctgcaggt 


gggcgccccg 


4380 


accacgactt 


ggcttccttg 


tgccttaaaa 


aacctgcctt 


cctgcagcca 


cacacccacc 


4440 


cggggtgtcc 


tggggaccca 


aggggtgggg 


gggtcacacc 


agagagaggc 


agggggcctg 


4500 


gccggctcct 


gcaggatcat 


gcagctgggg 


cgcggcggcc 


gcggctgcga 


caccccaacc 


4560 


ccagccctct 


aatcaagtca 


cgtgattctc 


ccttcacccc 


gcccccaggg 


ccttcccttc 


4620 


tgcccccagg 


cgggctcccc 


gctgctccag 


ctgcggagct 


ggtcgacata 


atctctgtat 


4680 


tatatacttt 


gcagttgcag 


acgtctgtgc 


ctagcaatat 


ttccagttga 


ccaaatattc 


4740 


taatcttttt 


tcatttatat 


gcaaaagaaa 


tagttttaag 


taacttttta 


tagcaagatg 


4800 


atacaatggt 


atgagtgtaa 


tctaaacttc 


cttgtggtat 


taccttgtat 


gctgttactt 


4860 


ttattttatt 


ccttgtaatt 


aagtcacagg 


caggacccag 


tttccagaga 


gcaggcgggg 


4920 


ccgcccagtg 


ggtcaggcac 


agggagcccc 


ggtcctatct 


tagagcccct 


gagcttcagg 


4980 
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gaaggggcgg 


gcgtgtcgcc 


gcctctggca 


tcgcctccgg 


ttgccttaca 


ccacgccttc 


5040 


acctgcagtc gcctagaaaa 


cttgctctca 


aacttcaggg 


ttttttcttc 


cttcaaattt 


5100 


tggaccaaag 


tctcatttct 


gtgttttgcc 


tgcctctgat 


gctgggaccc 


ggaaggcggg 


5160 


cgctcctcct 


gtcttctctg 


tgctctttct 


accgcccccg 


cgtcctgtcc 


c 99999ct ct 


5220 


cctaggatcc cctttccgta 


aaagcgtgta 


acaagggtgt 


aaatatttat 


aattttttat 


5280 


acctgttgtg 


agacccgagg 


ggcggcggcg 


cggtttttta 


tggtgacaca 


aatgtatatt 


5340 


ttgctaacag 


caattccagg 


ctcagtattg 


tgaccgcgga 


gccacagggg 

~J ZJ ZJ ZJ ZJ 


accccacgca 


5400 


cattccgttg 


ccttacccga 


tggcttgtga 


cgcggagaga 


accgattaaa 


accgtttgag 


5460 


aaactcctcc 


cttgtctagc 


cctgtgttcg 


ctgtggacgc 


tgtagaggca 


ggttggccag 


5520 


tctgtacctg gacttcgaat 


aaatcttctg 


tatcctcgct 


ccgttccgcc 


tt 


5572 


<210> 59 
<211> 6356 
<212> DMA 
<213> Homo sapien 












<400> 59 
caccgtggtg 


ttggcagcct 


cctccgtgtt 


catctcgatg 


aaggcctggt 


ttttcccctt 


60 


cagcatcagg 


aggttggtga 


ccttcccaaa 


gggcagcccc 


agggagatga 


cttccccctc 


120 


cgtgacgtcg 


atggggctat 


tccggcgcct 


ccactccgtc 


ccccgcgggt 


ctgctctgtg 


180 


tgccatggac 


ggcattgtcc 


cagatatagc 


cgttggtaca 


aagcggggat 


ctgacgagct 


240 


tttctctact 


tgtgtcacta 


acggaccgtt 


tatcatgagc 


agcaactcgg 


cttctgcagc 


300 


aaacggaaat 


gacagcaaga 


agttcaaagg 


tgacagccga 


agtgcaggcg 


tcccctctag 


360 


agtgatccac 


atccggaagc 


tccccatcga 


cgtcacggag 


9999aagtca 


tctccctggg 


420 


gctgcccttt 


gggaaggtca 


ccaacctcct 


gatgctgaag 


gggaaaaacc 


aggtacctga 


480 


gccgcgtttc 


tccggggtgc 


tcacaccgtg 


caggcgggga 


cgaggagggc 


ccagcgctca 


540 


ctgcctcccc 


aacaggcctt 


catcgagatg 


aacacggagg 


aggctgccaa 


caccatggtg 


600 


aactactaca 


cctcggtgac 


ccctgtgctg 


cgcggccagc 


ccatctacat 


ccagttctcc 


660 


aaccacaagg 


agctgaagac 


cgacagctct 


cccaaccagg 


cgcgggccca 


ggcggccctg 


720 


caggcggtga 


actcggtcca 


gtcggggaac 


ctggccttgg 


ctgcctcggc 


ggcggccgtg 


780 


gacgcaggga 


tggcgatggc 


cgggcagagc 


cccgtgctca 


ggatcatcgt 


ggagaacctc 


840 


ttctaccctg 


tgaccctgga 


tgtgctgcac 


cagattttct 


ccaagttcgg 


cacagtgttg 


900 


aagatcatca 


ccttcaccaa 


gaacaaccag 


ttccaggccc 


tgctgcagta 


tgcggacccc 


960 


gtgagcgccc 


agcacgccaa 


gctgtcgctg 


gacgggcaga 


acatctacaa 


cgcctgctgc 


1020 
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acgctgcgca tcgacttttc caagctcacc agcctcaacg tcaagtacaa caatgacaag 1080 

agccgtgact acacacgccc agacctgcct tccggggaca gccagccctc gctggaccag 114 0 

accatggccg cggccttcgg gtaagaggct gcccgacgcg gcgccagtgt gcagagtggt 12 00 

ctattagggc cgctcagtcc ggagccccgg cggcacgggc ccggccctgc accagggtga 1260 

tgcacctgct gctctctgca cggccagcac agcacggtcc agtgtccccc acgtcgggac 1320 

cacggccccc ctggagcagc ggatctgccc gcgaaggctc tgccggggcc gcccgccggc 1380 

C999tt9999 cccatcccgc agcacagcgc ccgctcgcgg tggaggttgt gggtgcgatg 144 0 

attagtgtct catttatttc taggtgcacc tggtataatc tcagcctctc cgtatgcagg 1500 

agctggtttc cctcccacct ttgccattcc tcaagctgca gggtattcaa acgcttggtc 1560 

ttggttcccc agcgactgca tgcccacacc accttcccag gcagctccgc atccacggcg 1620 

gcagcctggg cggactgggc actcgagtgc caggtcaggg gcccttcccg ggagagggga 168 0 

cgccacgtcc acagagcagg cttggccagg gcagtggtag gacagggctg tggaggccca 1740 

cgtgtggacg gcgccagcca gaagccgcta atcgcacagt ctttgtggtc tggttccctt 1800 

caagcacctc cgggcgagcg cgaggaccgc cagtggttgg agggatgtct ctagtagttg 1860 

aatttgagtg gcctggtaag cgcgcggccc gaggctgagc agcatgaccg gcggcgggac 1920 

gtgtgtgcgt ggccgtcctc ccgcttccct tctgggaccg gggcccggcc gccccagtgc 1980 

tggccgtgga cggagcagcg cgtgccgccc ggcggccgcc tcgtctgcat ggaggcatgg 204 0 

gcgcgcatgc gcggtggtcc cgggacgggc cctgcttgcc gagggccccg tgtctgtgct 2100 

ggcggagccg gagcttttct gtgctgtggg cggggcgcat gcagatgagc ccaggcccgg 2160 

cccggcccgt gctgtgagga gaggcgggcg ctggtgcagg agccggcggg tggctgtgcg 2220 

9999 tc 999C gaccccacag gcacccaggg tagggccaga gccagggccg cctcccgcgc 2280 

ggctcggctc ctcggtgcat gaggacgggg agcgtcggcc tctcccactc tgcggtggag 234 0 

tcgggggcgc cgccgctcat ctcacctcct gcttttccag gcctttccgt tccgaacgtc 2400 

cacggcgccc tggcccccct ggccatcccc tcggcggcgg cggcagctgc ggcggcaggt 2460 

cggatcgcca tcccgggcct ggcgggggca ggaaattctg tattgctggt cagcaacctc 2520 

aacccagagg tacgtgggct tttcctccgc gccgccgttc ctcccggaag agcgagcagg 2580 

ggatgttgtg ctcgggctcg ggtcccggag cagcgccgcc cctcgctgtg tctgcgcctc 264 0 

tgccccggca gcccctctgc tgcagcgctg agaccctcct tttccaagat ggcttgagtt 2700 

ttcctctttt cctgaattca catcttggtt cgcgttttct cttgcatgat acgcagaatg 2760 

aattattttt tggttaccca aaggcaatct aaaaattact tggagctgag cagggcgcgc 2820 
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agttcagagc cccagcaggc cgccctgctg gtcagcggga cacacagggt tagcgcggcg 2880 

gggtgtgggc agcccctgtg gctcgagagc ctgtcactta actggtggat ccgttatttc 2940 

ccagaatggt taaattcccc tcggggagag cttgggtgaa acaagaggcc caaaatatgt 3 000 

aactcttgat ggatgggatc ctcctggaat cagcatttca ggctgttctg tttcccggca 3060 

actcctgtga cttcctcaag gcggtgccca gcctggccct ccctcgcggg ggcccaggac 3120 

gcactccctg cctgtgcctc ttaccacgag ccgggtgctg tccctccagc tcctcctgag 3180 

cgaggccgtg cactaacaga ctcgatctcc cttcacgcct gtaatcgtcc ttagagactg 3240 

acttgtgcca cagctgcggg agacagcgct gctgcagggc ttggctgagg tggccgcgag 3300 

cggagctgca cgcgcgctcc tggggggcgt ggttgggatc ggcttctggg agcccctcca 3360 

gcagcgctgt ctctgccggc atgtgagtga aggtgct'tct cttaccgtgt gcccgtcgag 3420 

gtaaacctgt aactggaatg tgtgtggagt gtgactgata gaacactacc tgattcttat 3480 

gtatttactg acctgtgttt ttttgctact ttttttcttt tctccccttc ccctttccct 3540 

attttttttc ttgccctgat ccggaatttc tttgccaact gactgcacgg tacttctgct 3600 

tcctgttgtt gcttgaaaca aaacaaaaac ataaacaaat aaaaaacaaa aattccccct 3660 

caaaccctgc tctccggaaa ccaacctgcc cttgaatatt aacatcctga caacttcatc 3720 

atccatcaac cactgcacgc ctgcggggac tgtcttcctc gtgtggacga ttggcaactc 3780 

gccccccttg acctctccct ctcccctgtc cctccgctgc cttgctctgc tgtctctaaa 3840 

gagagtcaca ccccaaagcc tctttattct tttcggtatg ttatcgttca cacttttatt 3900 

accttgtttt cattaattga gatgattttg atgccctgag acgtattatg agtttgcggt 3960 

ttggcactct gatgctccgt ggcatccgcc tcgttttatg gtttgctttc ggtttgcgaa 4020 

ttttatttgg tcccgtagat acgtacgcat ggtttatcgc cctgcatgct tttcagagtt 4080 

agacctgtcg gtggcatatg ccaccgtggc cacccgctgg cagcttacct gtcctggatg 414 0 

ctatgacttt gctgaacgga gctgctcctg ttagcgcgcc ctgtggctgc gagacgcagc 4200 

tccgcagtgg ccgataaagc aaacccggcc gggctgagcc gggccttgtg ggggtgcgcg 4260 

gggccggggc tgacggggag atgggcgggg caggcagcag gagactcagg ccccatccct 4320 

gggcttttga aggcgcgtct acggtgacgt gcagcgcgtg aagatcctgt tcaataagaa 4380 

ggagaacgcc ctagtgcaga tggcggacgg caaccaggcc cagctggcca tgagccacct 4440 

gaacgggcac aagctgcacg ggaagcccat ccgcatcacg ctctcgaagc accagaacgt 4500 

gcagctgccc cgcgagggcc aggaggacca gggcctgacc aaggactacg gcaactcacc 4560 

cctgcaccgc ttcaagaagc cgggctccaa gaacttccag aacatattcc cgccctcggc 4620 

cacgctgcac ctctccaaca tcccgccctc agtctccgag gaggatctca aggtcctgtt 4680 
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ttccagcaat gggggcgtcg tcaaaggatt caagttcttc cagaaggacc gcaagatggc 4740 

actgatccag atgggctccg tggaggaggc ggtccaggcc ctcattgacc tgcacaacca 4800 

cgacctcggg gagaaccacc acctgcgggt ctccttctcc aagtccacca tctaggggca 4 860 

caggccccca cggccgggcc ccctggcgac aacttccatc attccagaga aaagccactt 4 920 

taaaaacagc tgaagtgacc ttagcagacc agagatttta tttttttaaa gagaaatcag 4980 

tttacctgtt tttaaaaaaa ttaaatctag ttcaccttgc tcaccctgcg gtgacaggga 5040 

cagctcaggc tcttggtgac tgtggcagcg ggagttcccg gccctccaca cccggggcca 5100 

gaccctcggg gccatgcctt ggtggggcct gtgtcgggcg tggggcctgc aggtgggcgc 5160 

cccgaccacg acttggcttc cttgtgcctt aaaaaacctg ccttcctgca gccacacacc 522 0 

cacccggggt gtcctgggga cccaaggggt gggggggtca caccagagag aggcaggggg 52 8 0 

cctggccggc tcctgcagga tcafcgcagct ggggcgcggc ggccgcggct gcgacacccc 5340 

aaccccagcc ctctaatcaa gtcacgtgat tctcccttca ccccgccccc agggccttcc 5400 

cttctgcccc caggcgggct ccccgctgct ccagctgcgg agctggtcga cataatctct 5460 

gtattatata ctttgcagtt gcagacgtct gtgcctagca atatttccag ttgaccaaat 5520 

attctaatct tttttcattt atatgcaaaa gaaatagttt taagtaactt tttatagcaa 5580 

gatgatacaa tggtatgagt gtaatctaaa cttccttgtg gtattacctt gtatgctgtt 564 0 

acttttattt tattccttgt aattaagtca caggcaggac ccagtttcca gagagcaggc 5700 

ggggccgccc agtgggtcag gcacagggag ccccggtcct atcttagagc ccctgagctt 5760 

cagggaaggg gcgggcgtgt cgccgcctct ggcatcgcct ccggttgcct tacaccacgc 5820 

cttcacctgc agtcgcctag aaaacttgct ctcaaacttc agggtttttt cttccttcaa 5880 

attttggacc aaagtctcat ttctgtgttt tgcctgcctc tgatgctggg acccggaagg 5940 

cgggcgctcc tcctgtcttc tctgtgctct ttctaccgcc cccgcgtcct gtcccggggg 6000 

ctctcctagg atcccctttc cgtaaaagcg tgtaacaagg gtgtaaatat ttataatttt 6060 

ttatacctgt tgtgagaccc gaggggcggc ggcgcggtfct tttatggtga cacaaatgta 612 0 

tattttgcta acagcaattc caggctcagt attgtgaccg cggagccaca ggggacccca 6180 

cgcacattcc gttgccttac ccgatggctt gtgacgcgga gagaaccgat taaaaccgtt 6240 

tgagaaactc ctcccttgtc tagccctgtg ttcgctgtgg acgctgtaga ggcaggttgg 6300 

ccagtctgta cctggacttc gaataaatct tctgtatcct cgctccgttc cgcctt 6356 



<210> 60 
<211> 3330 
<212> DNA 
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<213> Homo sapien 
<400> 60 

caccgtggtg ttggcagcct cctccgtgtt catctcgatg aaggcctggt ttttcccctt 60 

cagcatcagg aggttggtga ccttcccaaa gggcagcccc agggagatga cttccccctc 120 

cgtgacgtcg atggggctat tccggcgcct ccactccgtc ccccgcgggt ctgctctgtg 180 

tgccatggac ggcattgtcc cagatatagc cgttggtaca aagcggggat ctgacgagct 240 

tttctctact tgtgtcacta acggaccgtt tatcatgagc agcaactcgg cttctgcagc 300 

aaacggaaat gacagcaaga agttcaaagg tgacagccga agtgcaggcg tcccctctag 360 

agtgatccac atccggaagc tccccatcga cgtcacggag ggggaagtca tctccctggg 420 

gctgcccttt gggaaggtca ccaacctcct gatgctgaag gggaaaaacc aggtacctga 4 80 

gccgcgtttc tccggggtgc tcacaccgtg caggcgggga cgaggagggc ccagcgctca 540 

ctgcctcccc aacaggcctt catcgagatg aacacggagg aggctgccaa caccatggtg 600 

aactactaca cctcggtgac ccctgtgctg cgcggccagc ccatctacat ccagttctcc 660 

aaccacaagg agctgaagac cgacagctct cccaaccagg cgcgggccca ggcggccctg 720 

caggcggtga actcggtcca gtcggggaac ctggccttgg ctgcctcggc ggcggccgtg 780 

gacgcaggga tggcgatggc cgggcagagc cccgtgctca ggatcatcgt ggagaacctc 84 0 

ttctaccctg tgaccctgga tgtgctgcac cagattttct ccaagttcgg cacagtgttg 900 

aagatcatca ccttcaccaa gaacaaccag ttccaggccc tgctgcagta tgcggacccc 960 

gtgagcgccc agcacgccaa gctgtcgctg gacgggcaga acatctacaa cgcctgctgc 1020 

acgctgcgca tcgacttttc caagctcacc agcctcaacg tcaagtacaa caatgacaag 1080 

agccgtgacfc acacacgccc agacctgcct tccggggaca gccagccctc gctggaccag 1140 

accatggccg cggccttcgg cctttccgtt ccgaacgtcc acggcgccct ggcccccctg 1200 

gccatcccct cggcggcggc ggcagctgcg gcggcaggtc ggatcgccat cccgggcctg 1260 

gcgggggcag gaaattctgt attgctggtc agcaacctca acccagaggc gtctacggtg 1320 

acgtgcagcg cgtgaagatc ctgttcaata agaaggagaa cgccctagtg cagatggcgg 1380 

acggcaacca ggcccagctg gccatgagcc acctgaacgg gcacaagctg cacgggaagc 1440 

ccatccgcat cacgctctcg aagcaccaga acgtgcagct gccccgcgag ggccaggagg 1500 

accagggcct gaccaaggac tacggcaact cacccctgca ccgcttcaag aagccgggct 1560 

ccaagaactt ccagaacata ttcccgccct cggccacgct gcacctctcc aacatcccgc 1620 

cctcagtctc cgaggaggat ctcaaggtcc tgttttccag caatgggggc gtcgtcaaag 1680 

gattcaagtt cttccagaag gaccgcaaga tggcactgat ccagatgggc tccgtggagg 174 0 
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aggcggtcca ggccctcatt gacctgcaca accacgacct cggggagaac caccacctgc 1800 

gggtctcctt ctccaagtcc accatctagg ggcacaggcc cccacggccg ggccccctgg 1860 

cgacaacttc catcattcca gagaaaagcc actttaaaaa cagctgaagt gaccttagca 1920 

gaccagagat tttatttttt taaagagaaa tcagtttacc tgtttttaaa aaaattaaat 1980 

ctagttcacc ttgctcaccc tgcggtgaca gggacagctc aggctcttgg tgactgtggc 2040 

agcgggagtt cccggccctc cacacccggg gccagaccct cggggccatg ccttggtggg 2100 

gcctgtgtcg ggcgtggggc ctgcaggtgg gcgccccgac cacgacttgg cttccttgtg 2160 

ccttaaaaaa cctgccttcc tgcagccaca cacccacccg gggtgtcctg gggacccaag 2220 

gggtgggggg gtcacaccag agagaggcag ggggcctggc cggctcctgc aggatcatgc 2280 

agctggggcg cggcggccgc ggctgcgaca ccccaacccc agccctctaa tcaagtcacg 2340 

tgattctccc ttcaccccgc ccccagggcc ttcccttctg cccccaggcg ggctccccgc 2400 

tgctccagct gcggagctgg fccgacataat ctctgtatta tatactttgc agttgcagac 2460 

gtctgtgcct agcaatattt ccagttgacc aaatattcta atcttttttc atttatatgc 2520 

aaaagaaata gttttaagta actttttata gcaagatgat acaatggtat gagtgtaatc 2580 

taaacttcct tgtggtatta ccttgtatgc tgttactttt attttattcc ttgtaattaa 2640 

gtcacaggca ggacccagtt tccagagagc aggcggggcc gcccagtggg tcaggcacag 2700 - 

ggagccccgg tcctatctta gagcccctga gcttcaggga aggggcgggc gtgtcgccgc 2760 

ctctggcatc gcctccggtt gccttacacc acgccttcac ctgcagtcgc ctagaaaact 2820 

tgctctcaaa cttcagggtt ttttcttcct tcaaattttg gaccaaagtc tcatttctgt 2880 

gttttgcctg cctctgatgc tgggacccgg aaggcgggcg ctcctcctgt cttctctgtg 2940 

ctctttctac cgcccccgcg tcctgtcccg ggggctctcc taggatcccc tttccgtaaa 3 000 

agcgtgtaac aagggtgtaa atatttataa ttttttatac ctgttgtgag acccgagggg 3060 

cggcggcgcg gttttttatg gtgacacaaa tgtatatttt gctaacagca attccaggct 3120 

cagtattgtg accgcggagc cacaggggac cccacgcaca ttccgttgcc ttacccgatg 3180 

gcttgtgacg cggagagaac cgattaaaac cgtttgagaa actcctccct tgtctagccc 3240 

tgtgttcgct gtggacgctg tagaggcagg ttggccagtc tgtacctgga cttcgaataa 3300 

atcttctgta tcctcgctcc gttccgcctt 3330 

<210> 61 

<211> 6229 2 

<212> DNA 

<213> Homo sapien 

<400> 61 
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caccgtggtg ttggcagcct cctccgtgtt catctcgatg aaggcctggt ttttcccctt 60 

cagcatcagg aggttggtga ccttcccaaa gggcagcccc agggagatga cttccccctc 120 

cgtgacgtcg atggggctat tccggcgcct ccactccgtc ccccgcgggt ctgctctgtg 180 

tgccatggac ggcattgtcc cagatatagc cgttggtaca aagcggggat ctgacgagct 240 

tttctctact tgtgtcacta acggaccgtt tatcatgagc agcaactcgg cttctgcagc 300 

aaacggaaat gacagcaaga agttcaaagg tgacagccga agtgcaggcg tcccctctag 360 

agtgatccac atccggaagc tccccatcga cgtcacggag ggggaagtca tctccctggg 420 

gctgcccttt gggaaggtca ccaacctcct gatgctgaag gggaaaaacc aggtacctga 480 

gccgcgtttc tccggggtgc tcacaccgtg caggcgggga cgaggagggc ccagcgctca 540 

ctgcctcccc aacaggcctt catcgagatg aacacggagg aggctgccaa caccatggtg 600 

aactactaca cctcggtgac ccctgtgctg cgcggccagc ccatctacat ccagttctcc 660 

aaccacaagg agctgaagac cgacagctct cccaaccagg cgcgggccca ggcggccctg 720 

caggcggtga actcggtcca gtcggggaac ctggccttgg ctgcctcggc ggcggccgtg 780 

gacgcaggga tggcgatggc cgggcagagc cccgtgctca ggatcatcgt ggagaacctc 84 0 

ttctaccctg tgaccctgga tgtgctgcac cagattttct ccaagttcgg cacagtgttg 900 

aagatcatca ccttcaccaa gaacaaccag ttccaggccc tgctgcagta tgcggacccc 960 

gtgagcgccc agcacgccaa gctgtcgctg gacgggcaga acatctacaa cgcctgctgc 102 0 

acgctgcgca tcgacttttc caagctcacc agcctcaacg tcaagtacaa caatgacaag 1080 

agccgtgact acacacgccc agacctgcct tccggggaca gccagccctc gctggaccag 114 0 

accatggccg cggccttcgc ctctccgtat gcaggagctg gtttccctcc cacctttgcc 1200 

attcctcaag ctgcagggta ttcaaacgct tggtcttggt tccccagcga ctgcatgccc 1260 

acaccacctt cccaggcagc tccgcatcca cggcggcagc cfcgggcggac tgggcactcg 1320 

agtgccaggt caggggccct tcccgggaga ggggacgcca cgtccacaga gcaggcttgg 1380 

ccagggcagt ggtaggacag ggctgtggag gcccacgtgt ggacggcgcc agccagaagc 144 0 

cgctaatcgc acagtctttg tggtctggtt cccttcaagc acctccgggc gagcgcgagg 1500 

accgccagtg gttggaggga tgtctctagt agttgaattt gagtggcctg gtaagcgcgc 1560 

ggcccgaggc tgagcagcat gaccggcggc gggacgtgtg tgcgtggccg tcctcccgct 1620 

tcccttctgg gaccggggcc cggccgcccc agtgctggcc gtggacggag cagcgcgtgc 1680 

cgcccggcgg ccgcctcgtc tgcatggagg catgggcgcg catgcgcggt ggtcccggga 174 0 

cgggccctgc ttgccgaggg ccccgtgtct gtgctggcgg agccggagct tttctgtgct 1800 

gtgggcgggg cgcatgcaga tgagcccagg cccggcccgg cccgtgctgt gaggagaggc 1860 
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gggcgctggt 


gcaggagccg 


gcgggtggct gtgcgggggt 


cgggcgaccc 


cacaggcacc 


1920 


cagggtaggg 


ccagagccag 


ggccgcctcc 


cgcgcggctc 


ggctcctcgg 


tgcatgagga 


1980 


cggggagcgt 


cggcctctcc 


cactctgcgg 


tggagtcggg 


ggcgccgccg 


ctcatctcac 


2040 


ctcctgcttt 


tccaggcctt 


tccgttccga acgtccacgg 


cgccctggcc 


cccctggcca 


2100 


tcccctcggc 


ggcggcggca 


gctgcggcgg caggtcggat 


cgccatcccg 


ggcctggcgg 


2160 


gggcaggaaa 


ttctgtattg 


ctggtcagca 


acctcaaccc 


agaggtacgt 


gggcttttcc 


2220 


tccgcgccgc 


cgttcctccc 


ggaagagcga gcaggggatg 


ttgtgctcgg 


gctcgggtcc 


2280 


cggagcagcg 


ccgcccctcg 


ctgtgtctgc 


gcctctgccc 


cggcagcccc 


tctgctgcag 


2340 


cgctgagacc 


ctccttttcc 


aagatggctt 


gagttttcct 


cttttcctga 


attcacatct 


2400 


tggttcgcgt 


tttctcttgc 


atgatacgca 


gaatgaatta 


ttttttggtt 


acccaaaggc 


2460 


aatctaaaaa 


ttacttggag 


ctgagcaggg 


cgcgcagttc 


agagccccag 


caggccgccc 


2520 


tgctggtcag 


cgggacacac 


agggttagcg 


cggcggggtg 


tgggcagccc 


ctgtggctcg 


2580 


agagcctgtc 


acttaactgg 


tggatccgtt 


atttcccaga 


atggttaaat 


tcccctcggg 


2640 


gagagcttgg gtgaaacaag 


aggcccaaaa 


tatgtaactc 


ttgatggatg 


ggatcctcct 


2700 


ggaatcagca 


tttcaggctg 


ttctgtttcc 


cggcaactcc 


tgtgacttcc 


tcaaggcggt 


2760 


gcccagcctg gccctccctc 


gcgggggccc aggacgcact 


ccctgcctgt 


gcctcttacc 


2820 


acgagccggg 


tgctgtccct 


ccagctcctc 


ctgagcgagg 


ccgtgcacta 


acagactcga 


2880 


tctcccttca 


cgcctgtaat 


cgtccttaga 


gactgacttg 


tgccacagct 


gcgggagaca 


2940 


gcgctgctgc 


agggcttggc 


tgaggtggcc 


gcgagcggag 


ctgcacgcgc gctcctgggg 


3000 


99cgtggttg ggatcggctt 


ctgggagccc 


ctccagcagc 


gctgtctctg 


ccggcatgtg 


3060 


agtgaaggtg 


cttctcttac 


cgtgtgcccg 


tcgaggtaaa 


cctgtaactg gaatgtgtgt 


3120 


ggagtgtgac 


tgatagaaca 


ctacctgafct 


cttatgtatt 


tactgacctg 


tgtttttttg 


3180 


ctactttttt 


tcttttctcc 


ccttcccctt 


tccctatttt 


ttttcttgcc 


ctgatccgga 


3240 


atttctttgc 


caactgactg 


cacggtactt 


ctgcttcctg 


ttgttgcttg aaacaaaaca 


3300 


aaaacataaa 


caaataaaaa 


acaaaaattc 


cccctcaaac 


cctgctctcc 


ggaaaccaac . 


3360 


ctgcccttga 


atattaacat 


cctgacaact 


tcatcatcca 


tcaaccactg 


cacgcctgcg 


3420 


gggactgtct 


tcctcgtgtg 


gacgattggc aactcgcccc 


ccttgacctc 


tccctctccc 


3480 


ctgtccctcc gctgccttgc 


tctgctgtct 


ctaaagagag 


tcacacccca 


aagcctcttt 


3540 


attcttttcg gtatgttatc 


gttcacactt 


ttattacctt 


gttttcatta attgagatga 


3600 


ttttgatgcc 


ctgagacgta 


ttatgagttt 


gcggtttggc 


actctgatgc 


tccgtggcat 


3660 



WO 2004/050858 



95/383 



PCT/US2003/038808 



: ccgcctcgtt 

cgcatggttt 
gtggccaccc 
tcctgttagc 
cggccgggct 

-■■'! cggggcaggc 
i 

gacgtgcagc 

i 

gacggcaacc 
cccatccgca 
gaccagggcc 
tccaagaact 
ccctcagtct 

! ggattcaagt 

gaggcggtcc 
cgggtctcct 

j gcgacaactt 

j agaccagaga 

i 

• tctagttcac 

cagcgggagt 
ggcctgtgtc 
gccttaaaaa 
ggggtggggg 

! cagctggggc 

gtgattctcc 
ctgctccagc 
cgtctgtgcc 

: caaaagaaat 

ctaaacttcc 
: agtcacaggc 

! gggagccccg 

cctctggcat 



ttatggtttg ctttcggttt gcgaatttta 
atcgccctgc atgcttttca gagttagacc 
gctggcagct tacctgtcct ggatgctatg 
gcgccctgtg gctgcgagac gcagctccgc 
gagccgggcc ttgtgggggt gcgcggggcc 
agcaggagac tcaggcccca tccctgggct 
gcgtgaagat cctgttcaat aagaaggaga 
aggcccagct ggccatgagc cacctgaacg 
tcacgctctc gaagcaccag aacgtgcagc 
tgaccaagga ctacggcaac tcacccctgc 
tccagaacat attcccgccc tcggccacgc 
ccgaggagga tctcaaggtc ctgttttcca 
tcttccagaa ggaccgcaag atggcactga 
aggccctcat tgacctgcac aaccacgacc 
tctccaagtc caccatctag gggcacaggc 
ccatcattcc agagaaaagc cactttaaaa 
ttttattttt ttaaagagaa atcagtttac 
cttgctcacc ctgcggtgac agggacagct 
tcccggccct ccacacccgg ggccagaccc 
gggcgtgggg cctgcaggtg ggcgccccga 
acctgccttc ctgcagccac acacccaccc 
ggtcacacca gagagaggca gggggcctgg 
gcggcggccg cggctgcgac accccaaccc 
cttcaccccg cccccagggc cttcccttct 
tgcggagctg gtcgacataa tctctgtatt 
tagcaatatt tccagttgac caaatattct 
agttttaagt aactttttat agcaagatga 
ttgtggtatt accttgtatg ctgttacttt 
aggacccagt ttccagagag caggcggggc 
gtcctatctt agagcccctg agcttcaggg 
cgcctccggt tgccttacac cacgccttca 



tttggtcccg tagatacgta 


3720 


tgtcggtggc atatgccacc 


3780 


actttgctga acggagctgc 


3840 


agtggccgat 


aaagcaaacc 


3900 


ggggctgacg 


gggagatggg 


3960 


tttgaaggcg 


cgtctacggt 


4020 


acgccctagt 


gcagatggcg 


4080 


ggcacaagct 


gcacgggaag 


4140 


tgccccgcga 


gggccaggag 


4200 


accgcttcaa 


gaagccgggc 


4260 


tgcacctctc 


caacatcccg 


4320 


gcaatggggg 


cgtcgtcaaa 


4380 


tccagatggg 


ctccgtggag 


4440 


tcggggagaa 


ccaccacctg 


4500 


ccccacggcc 


gggccccctg 


4560 


acagctgaag 


tgaccttagc 


4620 


ctgtttttaa 


aaaaattaaa 


4680 


caggctcttg 


gtgactgtgg 


4740 


tcggggccat 


gccttggtgg 


4800 


ccacgacttg gcttccttgt 


4860 


ggggtgtcct 


ggggacccaa 


4920 


ccggctcctg 


caggatcatg 


4980 


cagccctcta 


atcaagtcac 


5040 


gcccccaggc 


gggctccccg 


5100 


atatactttg 


cagttgcaga 


5160 


aatctttttt 


catttatatg 


5220 


tacaatggta 


tgagtgtaat 


5280 


tattttattc 


cttgtaatta 


5340 


cgcccagtgg gtcaggcaca 


5400 


aaggggcggg 


cgtgtcgccg 


5460 


cctgcagtcg 


cctagaaaac 


5520 
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ttgctctcaa 


acttcagggt 


tttttcttcc 


ttcaaatttt 


ggaccaaagt 


ctcatttccg 


c c o n 


t- — f f- t- 4- f- — , ~ ~i ¥■ 

cgct-tcgccc 


gcctctgatg ctgggacccg gaaggcgggc 


gctcctcctg 


tcxcccctgc 


bo40 


gctctttcta 


ccgcccccgc 


gtcctgtccc 


gggggctctc 


ctaggatccc 


ct ttccgtaa 


c i a r\ 
5/00 


aagcgtgtaa 


caagggtgta 


aatatttata 


attttttata 


cctgttgtga 


gacccgaggg 


5760 


gcggcggcgc 


ggttttttat 


ggtgacacaa 


atgtatattt 


tgctaacagc 


aattccaggc 


5820 


tcagtattgt 


gaccgcggag 


ccacagggga 


ccccacgcac 


atcccgttgc 


cttacccgat 


5880 


ggcttgtgac 


gcggagagaa 


ccgattaaaa 


ccgtttgaga 


aactcctccc 


ctgtctagcc 


5940 


ctgtgttcgc 


tgtggacgct 


gtagaggcag gttggccagt 


ctgtacctgg 


acttcgaata 


6000 


aatcttctgt 


atcctcgctc 


cgttccgcct 


tcgttgctgg 


cttcctcttt 


ctgtgccgcg 


6060 


ggcggccccg 


actcccgccc 


gggtgtccgc 


cgaagcgcca 


ggtggggcca 


cgggcggggt 


6120 


gggcccggcc 


ctgagagtgg gggtccctgc 


gtggggcgtg 


cgggccagag cgaatgacca 


6180 


agagccggac 


ctcggtttct gccgctttat 


tgagcatccg 


cgcggccgc 




6229 


<210> 62 

<211> 5745 

<212> DNA 

<213> Homo sapien 












<400> 62 
caccgtggtg 


ttggcagcct 


cctccgtgtt 


catctcgatg 


aaggcctggt 


ttttcccctt 


60 


cagcatcagg 


aggttggtga 


ccttcccaaa 


gggcagcccc 


agggagatga 


cttccccctc 


120 


cgtgacgtcg 


atggggctat 


tccggcgcct 


ccactccgtc 


ccccgcgggt 


ctgctctgtg 


18 0 


tgccatggac 


ggcattgtcc cagatatagc 


cgttggtaca 


aagcggggat 


ctgacgagct 


240 


tttctctact 


tgtgtcacta acggaccgtt 


tatcatgagc 


agcaactcgg 


cttctgcagc 


J00 


aaacggaaat 


gacagcaaga 


agttcaaagg 


tgacagccga 


agtgcaggcg 


tcccctctag 


360 


agtgatccac 


atccggaagc 


tccccatcga 


cgtcacggag 


ggggaagtca 


tctccctggg 


420 


gctgcccttt 


gggaaggtca 


ccaacctcct 


gatgctgaag 


gggaaaaacc 


aggtacctga 


480 


gccgcgtttc 


tccggggtgc 


tcacaccgtg 


caggcgggga 


cgaggagggc 


ccagcgctca 


540 


ctgcctcccc 


aacaggcctt 


catcgagatg 


aacacggagg 


aggctgccaa 


caccatggtg 


600 


aactactaca 


cctcggtgac 


ccctgtgctg 


cgcggccagc 


ccatctacat 


ccagttctcc 


660 


aaccacaagg 


agctgaagac 


cgacagctct 


cccaaccagg 


cgcgggccca ggcggccctg 


720 


caggcggtga 


actcggtcca gtcggggaac 


ctggccttgg 


ctgcctcggc ggcggccgtg 


780 


gacgcaggga 


tggcgatggc 


cgggcagagc 


cccgtgctca 


ggatcatcgt ggagaacctc 


840 


ttctaccctg 


tgaccctgga 


tgtgctgcac 


cagattttct 


ccaagttcgg 


cacagtgttg 


900 
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aagatcatca ccttcaccaa gaacaaccag ttccaggccc tgctgcagta tgcggacccc 960 

gtgagcgccc agcacgccaa gctgtcgctg gacgggcaga acatctacaa cgcctgctgc 102 0 

acgctgcgca tcgacttttc caagctcacc agcctcaacg tcaagtacaa caatgacaag 1080 

agccgtgact acacacgccc agacctgcct tccggggaca gccagccctc gctggaccag 114 0 

accatggccg cggccttcgc ctctccgtat gcaggagctg gtttccctcc cacctttgcc 1200 

attcctcaag ctgcagggta ttcaaacgct tggtcttggt tccccagcga ctgcatgccc 1260 

acaccacctt cccaggcagc tccgcatcca cggcggcagc ctgggcggac tgggcactcg 132 0 

agtgccaggt caggggccct tcccgggaga ggggacgcca cgtccacaga gcaggcttgg 1380 

ccagggcagt ggtaggacag ggctgtggag gcccacgtgt ggacggcgcc agccagaagc 1440 

cgctaatcgc acagtctttg tggtctggtt cccttcaagc acctccgggc gagcgcgagg 1500 

accgccagtg gttggaggga tgtctctagt agttgaattt gagtggcctg gtaagcgcgc 1560 

ggcccgaggc tgagcagcat gaccggcggc gggacgtgtg tgcgtggccg tcctcccgct 1620 

tcccttctgg gaccggggcc cggccgcccc agtgctggcc gtggacggag cagcgcgtgc 1680 

cgcccggcgg ccgcctcgtc tgcatggagg catgggcgcg catgcgcggt ggtcccggga 1740 

cgggccctgc ttgccgaggg ccccgtgtct gtgctggcgg agccggagct tttctgtgct 18 00 

gtgggcgggg cgcatgcaga tgagcccagg cccggcccgg cccgtgctgt gaggagaggc 1860 

gggcgctggt gcaggagccg gcgggtggct gtgcgggggt cgggcgaccc cacaggcacc 1920 

ca ggg ta ggg ccagagccag ggccgcctcc cgcgcggctc ggctcctcgg tgcatgagga 1980 

cggggagcgt cggcctctcc cactctgcgg tggagtcggg ggcgccgccg ctcatctcac 2040 

ctcctgcttt tccaggcctt tccgttccga acgtccacgg cgccctggcc cccctggcca 2100 

tcccctcggc ggcggcggca gctgcggcgg caggtcggat cgccatcccg ggcctggcgg 2160 

gggcaggaaa ttctgtattg ctggtcagca acctcaaccc agaggtacgt gggcttttcc 2220 

tccgcgccgc cgttcctccc ggaagagcga gcaggggatg ttgtgctcgg gctcgggtcc 2280 

cggagcagcg ccgcccctcg ctgtgtctgc gcctctgccc cggcagcccc tctgctgcag 2340 

cgctgagacc ctccttttcc aagatggctt gagttttcct cttttcctga attcacatct 2400 

tggttcgcgt tttctcttgc atgatacgca gaatgaatta ttttttggtt acccaaaggc 2460 

aatctaaaaa ttacttggag ctgagcaggg cgcgcagttc agagccccag caggccgccc 2520 

tgctggtcag cgggacacac agggttagcg cggcggggtg tgggcagccc ctgtggctcg 2580 

agagcctgtc acttaactgg tggatccgtt atttcccaga atggttaaat tcccctcggg 264 0 

gagagcttgg gtgaaacaag aggcccaaaa tatgtaactc ttgatggatg ggatcctcct 2700 
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ggaatcagca 


tttcaggctg 


ttctgtttcc 


cggcaactcc 


tgtgacttcc 


tcaaggcggt 


2760 


gcccagcctg 


gccctccctc 


gcgggggccc 


aggacgcact 


ccctgcctgt 


gcctcttacc 


2820 


acgagccggg 


tgctgtccct 


ccagctcctc 


ctgagcgagg 


ccgtgcacta 


acagactcga 


2880 


tctcccttca 


cgcctgtaat 


cgtccttaga 


gactgacttg 


tgccacagct 


gcgggagaca 


2940 


gcgctgctgc 


agggcttggc 


tgaggtggcc 


gcgagcggag 


ctgcacgcgc 


gctcctgggg 


3000 


ggcgtggttg 


ggatcggctt 


ctgggagccc 


ctccagcagc 


gctgtctctg 


ccggcatgtg 


3060 


agtgaaggtg 


cttctcttac 


cgtgtgcccg 


tcgaggtaaa 


cctgtaactg 


gaatgtgtgt 


3120 


ggagtgtgac 


tgatagaaca 


ctacctgatt 


cttatgtatt 


tactgacctg 


tgtttttttg 


3180 


ctactttttt 


tcttttctcc 


ccttcccctt 


tccctatttt 


ttttcttgcc 


ctgatccgga 


3240 


atttctttgc 


caactgactg 


cacggtactt 


ctgcttcctg 


ttgttgcttg 


aaacaaaaca 


3300 


aaaacataaa 


caaataaaaa 


acaaaaattc 


cccctcaaac 


cctgctctcc 


ggaaaccaac 


3360 


ctgcccttga 


atattaacat 


cctgacaact 


tcatcatcca 


tcaaccactg 


cacgcctgcg 


3420 


gggactgtct 


tcctcgtgtg 


gacgattggc 


aactcgcccc 


ccttgacctc 


tccctctccc 


3480 


ctgtccctcc 


gctgccttgc 


tctgctgtct 


ctaaagagag 


tcacacccca 


aagcctcttt 


3540 


attcttttcg 


gtatgttatc 


gttcacactt 


ttattacctt 


gttttcatta 


attgagatga 


3600 


ttttgatgcc 


ctgagacgta 


ttatgagttt 


gcggtttggc 


actctgatgc 


tccgtggcat 


3660 


ccgcctcgtt 


ttatggtttg 


ctttcggttt 


gcgaatttta 


tttggtcccg 


tagatacgta 


3720 


cgcatggttt 


atcgccctgc 


atgcttttca 


gagttagacc 


tgtcggtggc 


atatgccacc 


3780 


gtggccaccc 


gctggcagct 


tacctgtcct 


ggatgctatg 


actttgctga 


acggagctgc 


3840 


tcctgttagc 


gcgccctgtg 


gctgcgagac 


gcagctccgc 


a gtggccgat 


aaagcaaacc 


3900 


cggccgggct 


gagccgggcc 


ttgtgggggt 


gcgcggggcc 


ggggctgacg 


gggagatggg 


3960 


cggggcaggc 


agcaggagac 


tcaggcccca 


tccctgggct 


tttgaaggcg 


cgtctacggt 


4020 


gacgtgcagc 


gcgtgaagat 


cctgttcaat 


aagaaggaga 


acgccctagt 


gcagatggcg 


4080 


gacggcaacc 


aggcccagct 


ggccatgagc 


cacctgaacg 


ggcacaagct 


gcacgggaag 


4140 


cccatccgca 


tcacgctctc 


gaagcaccag 


aacgtgcagc 


tgccccgcga 


gggccaggag 


4200 


gaccagggcc 


tgaccaagga 


ctacggcaac 


tcacccctgc 


accgcttcaa 


gaagccgggc 


4260 


tccaagaact 


tccagaacat 


attcccgccc 


tcggccacgc 


tgcacctctc 


caacatcccg 


4320 


ccctcagtct 


ccgaggagga 


tctcaaggtc 


ctgttttcca 


gcaatggggg 


cgtcgtcaaa 


4380 


ggattcaagt 


tcttccagaa 


ggaccgcaag 


atggcactga 


tccagatggg 


ctccgtggag 


4440 


gaggcggtcc 


aggccctcat 


tgacctgcac 


aaccacgacc 


tcggggagaa 


ccaccacctg 


4500 


cgggtctcct 


tctccaagtc 


caccatctag 


gggcacaggc 


ccccacggcc 


gggccccctg 


4560 
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gcgacaactt ccatcattcc agagaaaagc cactttaaaa acagctgaag tgaccttagc 4620 

agaccagaga ttttattttt ttaaagagaa atcagtttac ctgtttttaa aaaaattaaa 4680 

tctagttcac cttgctcacc ctgcggtgac agggacagct caggctcttg gtgactgtgg 4740 

cagcgggagt tcccggccct ccacacccgg ggccagaccc tcggggccat gccttggtgg 4800 

ggcctgtgtc gggcgtgggg cctgcaggtg ggcgccccga ccacgacttg gcttccttgt 4860 

gccttaaaaa acctgccttc ctgcagccac acacccaccc ggggtgtcct ggggacccaa 4920 

ggggtggggg ggtcacacca gagagaggca gggggcctgg ccggctcctg caggatcatg 4980 

cagctggggc gcggcggccg cggctgcgac accccaaccc cagccctcta atcaagtcac 504 0 

gtgattctcc cttcaccccg cccccagggc cttcccttct gcccccaggc gggctccccg 5100 

ctgctccagc tgcggagctg gtcgacataa tctctgtatt atatactttg cagttgcaga 5160 

cgtctgtgcc tagcaatatt tccagttgac caaatattct aatctttttt catttatatg 5220 

caaaagaaat agttttaagt aactttttat agcaagatga tacaatggta tgagtgtaat 5280 

ctaaacttcc ttgtggtatt accttgtatg ctgttacttt tattttattc cttgtaatta 5340 

agtcacaggc aggacccagt ttccagagag caggcggggc cgcccagtgg gtcaggcaca 54 00 

gggagccccg gtcctatctt agagcccctg agcttcaggg aaggggcggg cgttctctgt 5460 

gctctttcta ccgcccccgc gtcctgtccc gggggctctc ctaggatccc ctttccgtaa 5520 

aagcgtgtaa caagggtgta aatatttata attttttata cctgttgtga gacccgaggg 5580 

gcggcggcgc ggttttttat ggtgacacaa atgtatattt tgctaacagc aattccaggc 5640 

tcagtattgt gaccgcggag ccacagggga ccccacgcac attccgttgc cttacccgat 5700 

ggcttgtgac gcggagagaa ccgattaaaa ccgtttgaga aactc 5745 

<210> 63 

<211> 5547 

<212> DNA 

<213> Homo sapien 

<400> 63 

caccgtggtg ttggcagcct cctccgtgtt catctcgatg aaggcctggt ttttcccctt 60 

cagcatcagg aggttggtga ccttcccaaa gggcagcccc agggagatga cttccccctc 120 

cgtgacgtcg atggggctat tccggcgcct ccactccgtc ccccgcgggt ctgctctgtg 180 

tgccatggac ggcattgtcc cagatatagc cgttggtaca aagcggggat ctgacgagct 240 

tttctctact tgtgtcacta acggaccgtt tatcatgagc agcaactcgg cttctgcagc 300 

aaacggaaat gacagcaaga agttcaaagg tgacagccga agtgcaggcg tcccctctag 360 

agtgatccac atccggaagc tccccatcga cgtcacggag ggggaagtca tctccctggg 420 
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gctgcccttt gggaaggtca ccaacctcct gatgctgaag gggaaaaacc aggtacctga 4 80 

gccgcgtttc tccggggtgc tcacaccgtg caggcgggga cgaggagggc ccagcgctca 540 

ctgcctcccc aacaggcctt catcgagatg aacacggagg aggctgccaa caccatggtg 600 

aactactaca cctcggtgac ccctgtgctg cgcggccagc ccatctacat ccagttctcc 660 

aaccacaagg agctgaagac cgacagctct cccaaccagg cgcgggccca ggcggccctg 720 

caggcggtga actcggtcca gtcggggaac ctggccttgg ctgcctcggc ggcggccgtg 780 

gacgcaggga tggcgatggc cgggcagagc cccgtgctca ggatcatcgt ggagaacctc 840 

ttctaccctg tgaccctgga tgtgctgcac cagattttct ccaagttcgg cacagtgttg 900 

aagatcatca ccttcaccaa gaacaaccag ttccaggccc tgctgcagta tgcggacccc 960 

gtgagcgccc agcacgccaa gctgtcgctg gacgggcaga acatctacaa cgcctgctgc 1020 

acgctgcgca tcgacttttc caagctcacc agcctcaacg tcaagtacaa caatgacaag 1080 

agccgtgact acacacgccc agacctgcct tccggggaca gccagccctc gctggaccag 1140 

accatggccg cggccttcgc ctctccgtat gcaggagctg gtttccctcc cacctttgcc 1200 

attcctcaag ctgcagggta ttcaaacgct tggtcttggt tccccagcga ctgcatgccc 1260 

acaccacctt cccaggcagc tccgcatcca cggcggcagc ctgggcggac tgggcactcg 1320 

agtgccaggt caggggccct tcccgggaga ggggacgcca cgtccacaga gcaggcttgg 1380 

ccagggcagt ggtaggacag ggctgtggag gcccacgtgt ggacggcgcc agccagaagc 1440 

cgctaatcgc acagtctttg tggtctggtt cccttcaagc acctccgggc gagcgcgagg 1500 

accgccagtg gttggaggga tgtctctagt agttgaattt gagtggcctg gtaagcgcgc 1560 

ggcccgaggc tgagcagcat gaccggcggc gggacgtgtg tgcgtggccg tcctcccgct 1620 

tcccttctgg gaccggggcc cggccgcccc agtgctggcc gtggacggag cagcgcgtgc 1680 

cgcccggcgg ccgcctcgtc tgcatggagg catgggcgcg catgcgcggt ggtcccggga 1740 

cgggccctgc ttgccgaggg ccccgtgtct gtgctggcgg agccggagct tttctgtgct 1800 

gtgggcgggg cgcatgcaga tgagcccagg cccggcccgg cccgtgctgt gaggagaggc 1860 

gggcgctggt gcaggagccg gcgggtggct gtgcgggggt cgggcgaccc cacaggcacc 1920 

cagggtaggg ccagagccag ggccgcctcc cgcgcggctc ggctcctcgg tgcatgagga 1980 

cggggagcgt cggcctctcc cactctgcgg tggagtcggg ggcgccgccg ctcatctcac 2040 

ctcctgcttt tccaggcctt tccgttccga acgtccacgg cgccctggcc cccctggcca 2100 

tcccctcggc ggcggcggca gctgcggcgg caggtcggat cgccatcccg ggcctggcgg 2160 

gggcaggaaa ttctgtattg ctggtcagca acctcaaccc agaggtacgt gggcttttcc 2220 
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tccgcgccgc cgttcctccc ggaagagcga gcaggggatg ttgtgctcgg gctcgggtcc 2280 

cggagcagcg ccgcccctcg ctgtgtctgc gcctctgccc cggcagcccc tctgctgcag 2340 

cgctgagacc ctccttttcc aagatggctt gagttttcct cttttcctga attcacatct 2400 

tggttcgcgt tttctcttgc atgatacgca gaatgaatta ttttttggtt acccaaaggc 2460 

aatctaaaaa ttacttggag ctgagcaggg cgcgcagttc agagccccag caggccgccc 2520 

tgctggtcag cgggacacac agggttagcg cggcggggtg tgggcagccc ctgtggctcg 2580 

agagcctgtc acttaactgg tggatccgtt atttcccaga atggttaaat tcccctcggg 2640 

gagagcttgg gtgaaacaag aggcccaaaa tatgtaactc ttgatggatg ggatcctcct 2700 

ggaatcagca tttcaggctg ttctgtttcc cggcaactcc tgtgacttcc tcaaggcggt 2760 

gcccagcctg gccctccctc gcgggggccc aggacgcact ccctgcctgt gcctcttacc 2820 

acgagccggg tgctgtccct ccagctcctc ctgagcgagg ccgtgcacta acagactcga 2880 

tctcccttca cgcctgtaat cgtccttaga gactgacttg tgccacagct gcgggagaca 2940 

gcgctgctgc agggcttggc tgaggtggcc gcgagcggag ctgcacgcgc gctcctgggg 3000 

ggcgtggttg ggatcggctt ctgggagccc ctccagcagc gctgtctctg ccggcatgtg 3060 

agtgaaggtg cttctcttac cgtgtgcccg tcgaggtaaa cctgtaactg gaatgtgtgt 3120 

ggagtgtgac tgatagaaca ctacctgatt cttatgtatt tactgacctg tgtttttttg 3180 

ctactttttt tcttttctcc ccttcccctt tccctatttt ttttcttgcc ctgatccgga 3240 

atttctttgc caactgactg cacggtactt ctgcttcctg ttgttgcttg aaacaaaaca 3300 

aaaacataaa caaataaaaa acaaaaattc cccctcaaac cctgctctcc ggaaaccaac 3360 

ctgcccttga atattaacat cctgacaact tcatcatcca tcaaccactg cacgcctgcg 3420 

gggactgtct tcctcgtgtg gacgattggc aactcgcccc ccttgacctc tccctctccc 3480 

ctgtccctcc gctgccttgc tctgctgtct ctaaagagag tcacacccca aagcctcttt 3540 

attcttttcg gtatgttatc gttcacactt ttattacctt gttttcatta attgagatga 3600 

ttttgatgcc ctgagacgta ttatgagttt gcggtttggc actctgatgc tccgtggcat 3660 

ccgcctcgtt ttatggtttg ctttcggttt gcgaatttta tttggtcccg tagatacgta 3720 

cgcatggttt atcgccctgc atgcttttca gagttagacc tgtcggtggc atatgccacc 3780 

gtggccaccc gctggcagct tacctgtcct ggatgctatg actttgctga acggagctgc 3840 

tcctgttagc gcgccctgtg gctgcgagac gcagctccgc agtggccgat aaagcaaacc 3900 

cggccgggct gagccgggcc ttgtgggggt gcgcggggcc ggggctgacg gggagatggg 3960 

cggggcaggc agcaggagac tcaggcccca tccctgggct tttgaaggcg cgtctacggt 4020 

gacgtgcagc gcgtgaagat cctgttcaat aagaaggaga acgccctagt gcagatggcg 4080 
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gacggcaacc 


aggcccagct 


ggccatgagc 


cacctgaacg 


ggcacaagct 


gcacgggaag 


4140 


cccatccgca 


tcacgctctc 


gaagcaccag 


aacgtgcagc 


tgccccgcga 


gggccaggag 


4200 


gaccagggcc 


tgaccaagga 


ctacggcaac 


tcacccctgc 


accgcttcaa 


gaagccgggc 


4260 


tccaagaact 


tccagaacat 


attcccgccc 


tccccacggc 


cgggccccct 


ggcgacaact 


4320 


tccatcattc 


cagagaaaag 


ccactttaaa 


aacagctgaa 


gtgaccttag 


cagaccagag 


4380 


attttatttt 


tttaaagaga 


aatcagttta 


cctgttttta 


aaaaaattaa 


atctagttca 


4440 


ccttgctcac 


cctgcggtga 


cagggacagc 


tcagggccgg 


ctcctgcagg 


atcatgcagc 


4500 


tggggcgcgg 


cggccgcggc 


tgcgacaccc 


caaccccagc 


cctctaatca 


agtcacgtga 


4560 


ttctcccttc 


accccgcccc 


cagggccttc 


ccttctgccc 


ccaggcgggc 


tccccgctgc 


4620 


tccagctgcg 


gagctggtcg 


acataatctc 


tgtattatat 


actttgcagt 


tgcagacgtc 


4680 


tgtgcctagc 


aatatttcca 


gttgaccaaa 


tattctaatc 


ttttttcatt 


tatatgcaaa 


4740 


agaaatagtt 


ttaagtaact 


ttttatagca 


agatgataca 


atggtatgag 


tgtaatctaa 


4800 


acttccttgt 


ggtattacct 


tgtatgctgt 


tacttttatt 


ttattccttg 


taattaagtc 


4860 


acaggcagga 


cccagtttcc 


agagagcagg 


cggggccgcc 


cagtgggtca 


ggcacaggga 


4920 


gccccggtcc 


tatcttagag 


cccctgagct 


tcagggaagg 


ggcgggcgtg 


tcgccgcctc 


4980 


tggcatcgcc 


tccggttgcc 


ttacaccacg 


ccttcacctg 


cagtcgccta 


gaaaacttgc 


5040 


tctcaaactt 


cagggttttt 


tcttccttca 


aattttggac 


caaagtctca 


tttctgtgtt 


5100 


ttgcctgcct 


ctgatgctgg 


gacccggaag 


gcgggcgctc 


ctcctgtctt 


ctctgtgctc 


5160 


tttctaccgc 


ccccgcgtcc 


tgtcccgggg 


gctctcctag 


gatccccttt 


ccgtaaaagc 


5220 


gtgtaacaag 


ggtgtaaata 


tttataattt 


tttatacctg 


ttgtgagacc 


cgaggggcgg 


5280 


cggcgcggtt 


ttttatggtg 


acacaaatgt 


atattttgct 


aacagcaatt 


ccaggctcag 


5340 


tattgtgacc 




pi CI CJ CI CI Pi c^ccc* 


^ CC1CP\ C Pi t~ t~ C 
C* l^U K- L« 


rat" t~ nrrf t**a 




5400 


tgtgacgcgg 


agagaaccga 


ttaaaaccgt 


ttgagaaact 


cctcccttgt 


ctagccctgt 


5460 


gttcgctgtg 


gacgctgtag 


aggcaggttg 


gccagtctgt 


acctggactt 


cgaataaatc 


5520 


ttctgtatcc 


tcgctccgtt 


ccgcctt 








5547 


<210> 64 

<211> 944 

<212> DNA 

<213> Homo sapien 












<400> 64 

aacataggtc agtacaggcc 


gggttatcgt 


cgcaagaaag 


cctgagtcct 


gtcctttctc 


60 


tctccccgga 


cagcatgagc 


ttcaccactc 


gctccacctt 


ctccaccaac 


taccggtccc 


120 
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tgggctctgt ccaggcgccc agctacggcg cccggccggt cagcagcgcg gccagcgtct 180 

atgcaggcgc tgggggctct ggttcccgga tctccgtgtc ccgctccacc agcttcaggg 24 0 

gcggcatggg gtccgggggc ctggccaccg ggatagccgg gggtctggca ggaatgggag 300 

gcatccagaa cgagaaggag accatgcaaa gcctgaacga ccgcctggcc tcttacctgg 360 

acagagtgag gagcctggag accgagaacc ggaggctgga gagcaaaatc cgggagcact 420 

tggagaagaa gggaccccag gtcagagact ggagccatta cttcaagatc atcgaggacc 480 

tgagggctca gatcttcgca aatactgtgg acaatgcccg catcgttctg cagattgaca 540 

atgcccgtct tgctgctgat gactttagag tcaagtatga gacagagctg gccatgcgcc 600 

agtctgtgga gaacgacatc catgggctcc gcaaggtcat tgatgacacc aatatcacac 660 

gactgcagct ggagacagag atcgaggctc tcaaggagga gctgctcttc atgaagaaga 720 

accacgaaga ggacsgtaca ggsctacaag mccmgatgsc mgstctgggt gmcgyggagg 780 

agatgccccc aacaccggga ctggcagacc aggcgactcg gcaagagacg agtggtcgag 840 

accgagggga gaaggtcgcc agaatgggga gccacggccc aagcggagtg tgaacgagcc 900 

ggtagagcgg cccgggagcg acgatagcgg gcacgggggc acag 944 

<210> 65 
<211> 1426 
<212> DNA 
<213> Homo sapien 

<400> 65 

gaaagcagcc tcgagggcca acaacacctg ctgtccgtgt ccatgcccgg ttggccaccc 60 

cgtttctggg ggcatgagct tcaccactcg ctccaccttc tccaccaact accggtccct 120 

gggctctgtc caggcgccca gctacggcgc ccggccggfcc agcagcgcgg ccagcgtcta 180 

tgcaggcgct gggggctctg gttcccggat ctccgtgtcc cgctccacca gcttcagggg 240 

cggcatgggg tccgggggcc tggccaccgg gatagccggg ggtctggcag gaatgggagg 3 00 

catccagaac gagaaggaga ccatgcaaag cctgaacgac cgcctggcct cttacctgga 360 

cagagtgagg agcctggaga ccgagaaccg gaggctggag agcaaaatcc gggagcactt 420 

ggagaagaag ggaccccagg tcagagactg gagccattac ttcaagatca tcgaggacct 480 

gagggctcag atcttcgcaa atactgtgga caatgcccgc atcgttctgc agattgacaa 540 

tgcccgtctt gctgctgatg actttagagt caagtatgag acagagctgg ccatgcgcca 600 

gtctgtggag aacgacatcc atgggctccg caaggtcatt gatgacacca atatcacacg 660 

actgcagctg gagacagaga tcgaggctct caaggaggag ctgctcttca tgaagaagaa 720 

ccacgaagag gaagtaaaag gcctacaagc ccagattgcc agctctgggt tgaccgtgga 780 
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ggtagatgcc 


cccaaatctc 


aggacctcgc 


caagatcatg 


gcagacatcc 


gggcccaata 


840 


tgacgagctg 


gctcggaaga 


accgagagga 


gctagacaag 


tactggtctc 


agcagattga 


900 


ggagagcacc 


acagtggtca 


ccacacagtc 


tgctgaggtt 


ggagctgctg 


agacgacgct 


960 


cacagagctg 


agacgtacag 


tccagtcctt 


ggagatcgac 


ctggactcca 


tgagaaatct 


1020 


gaaggccagc 


ttggagaaca 


gcctgaggga 


ggtggaggcc 


cgctacgccc 


tacagatgga 


1080 


gcagctcaac 


gggatcctgc 


tgcaccttga 


gtcagagctg 


gcacagaccc 


gggcagaggg 


1140 


acagcgccag 


gcccaggagt 


atgaggccct 


gctgaacatc 


aaggtcaagc 


tggaggctga 


1200 


gatcgccacc 


taccgccgcc 


tgctggaaga 


tggcgaggac 


tttaatcttg gtgatgcctt 


1260 


ggacagcagc 


aactccatgc 


aaaccatcca 


aaagaccacc 


acccgccgga 


tagtggatgg 


1320 


caaagtggtg 


tctgagacca 


atgacaccaa 


agttctgagg 


cattaagcca 


gcagaagcag 


1380 


ggtacccttt 


ggggagcagg 


aggccaataa 


aaagttcaga 


gttcat 




1426 



<210> 66 

<211> 739 

<212> DNA 

<213> Homo sapien 

<400> 66 



cccaaagaaa 


gcttggctat 


ttttattgat 


gtgtaatata 


caacctatgt 


aaatgaagtt 


60 


aggcctattg 


gtttgcaaat 


gcagctttaa 


cataattacc 


ttacctgtct 


ccttccccta 


120 


cccaatgctg 


agggacattg 


ctccccacct 


caccatcatg 


ccatgctttc 


tccccctggt 


180 


cataggtgat 


ctttccagaa 


cagctaacca 


ggtgcctggg 


gtctggagac 


cttactgctt 


240 


gaggagtgaa 


ttaagagaaa 


agactgcttg 


ctttcctcca 


gactttgagc 


cctggcctga 


300 


tgtagacctt 


tttgctctct 


cctccttcgt 


ataggtacgg 


ttcctggagc 


agcagaacaa 


360 


gatgctggag 


accaagtgga 


gcctcctgca 


gcagcagaag 


acggctcgaa 


gcaacatgga 


420 


caacatgttc 


gagagctaca 


tcaacaacct 


taggcggcag 


ctggagactc 


tgggccagga 


480 


gaagctgaag 


ctggaggcgg 


agcttggcaa 


catgcagggg 


ctggtggagg acttcaagaa 


540 


caaaagtgag 


caactcccac 


cctccaccca 


actgaagtca 


ccgtgctctc 


ctccacccct 


600 


tgggaccttg 


ggactaagtc 


cccatggccc 


tctgttgtgg 


gcagtgcagt 


cctatctaat 


660 


tagggtgacc 


acctgatgag 


gtttgctcgg 


aacagtctgt 


gtttatgccc 


aggttcctag 


720 


cacaattgtt 


gatagtacc 










739 



<210> 67 

<211> 3444 

<212> DNA 

<213> Homo sapien 
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<400> 67 



attggaagcg cactgcgcgg cctccaccgc gctctcggcc cgggttcggc gtgggggcgg 


60 


ggaggtaagt acggtaatgg gcggggccga gcgcgggcgc 


gctccggctt 


tcctcctgcg 


120 


cagtgctccc gtcagcgcag ggggcggggc gcctatatta cgtgcgcggc gccgctcctg 


180 


cgagaggacg ttgcgtgctc gctcgcgcca ggcgagtctc 


cgcgtctccc 


tcgcgaactc 


240 


ggtgaaagga attggcgccg ttcgacacca ggcggatccg 


ctctgcagca 


cgaacccatc 


300 


tccagccgca gccgcagccg ccgcccgggc cgaggagccg 


cctcggaaga 


cgagaacggc 


360 


gacgatcagg gtttccagga aggggaagat gagctcgggg 


acgaagagga 


aggcgcgggc 


420 


gacgagaacg ggcacgggga gcagcagcct caaccgccgg 


cgacgcagca 


gcaacagccc 


480 


caacagcagc gcggggccgc caaggaggcc gcggggaaga 


gcagcggccc 


cacctcgctg 


540 


ttcgcggtga cggtggcgcc gcccggggcg aggcagggcc 


agcagcaggc 


gggaggggac 


600 


ggcaaaacag aacagaaagg cggagataaa aagaggggtg 


ttaaaagacc 


acgagaagat 


660 


catggccgtg gatattttga gtacattgaa gagaacaagt atagcagagc 


caaatctcct 


720 


cagccacctg ttgaagaaga agatgaacac ttcgatgaca 


cagtggtttg 


tcttgatact 


780 


tataattgtg atctacattt taaaatatca agagatcgtc 


tcagtgcttc 


ttcccttaca 


840 


atggagagtt ttgcttttct ttgggctgga ggaagagcat cctatggtgt gtcaaaaggc 


900 


aaagtgtgtt ttgagatgaa ggttacagag aagatcccag 


taaggcattt 


atatacaaaa 


960 


gatattgaca tacatgaagt tcgtattggc tggtcactaa 


ctacaagtgg 


aatgttactt 


1020 


ggtgaagaag aattttctta tgggtattct ctaaaaggaa 


taaaaacatg 


caactgtgag 


1080 


actgaagatt atggagaaaa gtttgatgaa aatgatgtga ttacatgttt 


tgctaacttt 


1140 


gaaagtgatg aagtagaact ctcgtatgct aagaatggac 


aagatcttgg 


cgttgccttc 


1200 


aaaatcagta aggaagttct tgctggacgg ccactgttcc 


cgcatgttct 


ctgccacaac 


1260 


tgtgcagttg aatttaattt tggtcagaag gaaaagccat 


attttccaat 


acctgaagag 


1320 


tatactttca tccagaacgt ccccttagag gatcgagtta 


gaggaccaaa 


ggggcctgaa 


1380 


gagaagaaag attgtgaagt tgtgatgatg attggcttgc 


caggagctgg 


aaaaactacc 


1440 


tgggttacta aacatgcagc agaaaatcca gggaaatata 


acattcttgg 


cacaaatact 


1500 


attatggata agatgatggt ggcaggtttt aagaagcaaa 


tggcagatac 


tggaaaactg 


1560 


aacacactgt tgcagagagc cccccagtgt cttgggaaat 


ttattgagat 


tgctgcccga 


1620 


aagaagcgaa attttattct ggatcagaca aatgtgtctg 


ctgctgccca 


gaggagaaaa 


1680 


atgtgcctgt ttgcaggctt ccagcgaaaa gctgttgtag 


tttgcccaaa 


agatgaagac 


1740 


tataagcaaa gaacacagaa gaaagcagaa gtagagggga 


aagacctacc 


agaacatgcg 


1800 
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gtcctcaaaa 


tgaaaggaaa 


ctttaccctc 


ccagaggtag 


ctgagtgctt 


tgatgaaata 


1860 


acctatgttg 


aacttcagaa 


ggaagaagcc 


caaaaactct 


tggagcaata 


taaggaagaa 


1920 


agcaaaaagg 


ctcttccacc 


agaaaagaaa 


cagaacactg 


gctcaaagaa 


aagcaataaa 


1980 


aataagagtg gcaagaacca 


gtttaacaga 


ggtggtggcc 


atagaggacg 


tggaggattc 


2040 


aatatgcgtg gtggaaattt 


cagaggagga 


gcccctggga 


atcgtggcgg 


atataatagg 


2100 


aggggcaaca 


tgccacagag 


aggtggtggc 


ggtggaggaa 


gtggtggaat 


cggctatcca 


2160 


taccctcgtg cccctgtttt 


tcctggccgt 


ggtagttact 


caaacagagg 


gaactacaac 


2220 


agaggtggaa 


tgcccaacag 


agggaactac 


aaccagaact 


tcagaggacg 


aggaaacaat 


2280 


cgtggctaca 


aaaatcaatc 


tcagggctac 


aaccagtggc 


agcagggtca 


attctggggt 


2340 


cagaagccat 


ggagtcagca 


ttatcaccaa 


ggatattatt 


gaatacccaa 


ataaaacgaa 


2400 


ctgatacata 


tttctccaaa 


accttcacaa 


gaagtcgact 


gttttcttta 


gtaggctaac 


2460 


tttttaaaca 


ttccacaaga 


ggaagtgcct 


gcgggttcct 


tttttagaag 


ctttgtgggt 


2520 


tgattttttt 


tcttttcttt 


tttgtacatt 


tttaattgca 


gtttaaaagt 


gaatcgtaag 


2580 


agaacctcag 


cattgtgcac 


gataagagaa 


tgtgtcagta 


tttcagggtt 


ctacatttta 


2640 


tctgtaaaat 


gtgacttttt 


ttttttttta 


tcacaacaga 


agtaaaatgt 


tgctttgtac 


2700 


ctggtgtctt 


ttattaagaa 


tttactcccc 


ccatttctca 


cagagaataa 


cagtcgggag 


2760 


tcattgtcac aatataatag 


aaatgttagc 


aaccagattc 


atgtaaggac 


taagtggtcc 


2820 


tcatgaattg cattaagact 


ctgtactgct 


catattacac 


tccatcctct 


ctgtagtttg 


2880 


ctgggtagtg gagggggtaa 


gctaaatcat 


agtttctgac 


aataactggg 


aaggtttttt 


2940 


cttaaaataa 'caatggaatt 


ggtataattg 


ggattgaaaa 


ctaaaacttg 


gaactaagat 


3000 


agagaagatg gagtgtatgt 


agaagggctg 


ttaaaaatgt 


aaaacttggt 


tgcattattt 


3060 


gtggaggctc aaacttgtga 


aggttaatac 


cataattttt 


ccatttgttc 


tgcattttga 


3120 


ttctgaaaag aaagctggct 


ttgcccattt 


cttattaaaa 


aaacttgttg 


taaatccagt 


3180 


tgtctaatgg gatctatatg 


aagttagcca 


tgtctgtatg 


cccttctccc 


acaaaatact 


3240 


gtataactag 


tgtgcttgta 


gtagttaact 


ccaccatctt 


tgtaagctaa 


tgaaattgtg 


3300 


agtcacccat 


ttatatctta 


atttttaatc 


atgtcagttc 


ttgaatgggt 


atctccttag 


3360 


cctgctgatt 


tctttttctt 


tctaaagaaa 


gtgggtggag 


aaattaattt 


agacgtttgt 


3420 


ttgcaataaa aagaattcat 


ttta 








3444 



<210> 68 
<211> 2081 
<212> DNA 
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<213> Homo sapien 
<400> 68 

gagtatatat atatgtactc caagttcaat aaggaaattt taaaaggcac tcatcagcat 60 

agtgctagtt gtaattaaag caggcatact ggcatatagg gcctttaata ttacaggaat 120 

taagatgatc ctaacgtgtt ttgtactggt cgaagcttat ccttaggtga caggttctag 180 

agtctataaa gataatttag taattttcac tttttgactc atgggattta caagaatatt 240 

taaagggctt agaaagtaaa gacttgcgaa gaaaagcttt agcaggagag attgttcaat 300 

atagaagaga aaaatatccc agtgaattcc tgaaggaagg tatggcatat ttaatagcag 360 

ccctgattta aaatagtggg aattaggtga tcctgtagat ccttccaact ttatgatttt 420 

tagagatttt atttttgccc aagtcatcag taagtatgag gacatttttt ttgtgttttt 480 

ccaagtgggt tgtttaaaat tttttccttt ttttattcct cttcccaagg caaatgaaaa 54 0 

aaaagttgac cagcctccag aagctaaaaa gcccaaaata aaggtggtga atgttgagct 600 

gcctattgaa gccaacttgg tctggcagtt agggaaagac cttcttaaca tgtatattga 660 

gacagagggt aagatgataa tgcaagataa attggaaaaa gaaaggaatg atgctaaaaa 720 

tgcagttgag gaatatgtgt atgagttcag agacaagctg tgtggaccat atgaaaaatt 780 

tatatgtgag caggatcatc aaaafcttttt gagactcctc acagaaactg aagactggct 840 

gtatgaagaa ggagaggacc aagctaaaca agcatatgtt gacaagttgg aagaattaat 900 

gaaaattggc actccagtta aagttcggtt tcaggaagct gaagaacggc caaaaatgtt 960 

tgaagaacta ggacagaggc tgcagcatta tgccaagata gcagctgact tcagaaataa 1020 

ggatgagaaa tacaaccata ttgatgagtc tgaaatgaaa aaagtggaga agtctgttaa 1080 

tgaagtgatg gaatggatga ataatgtcat gaatgctcag gctaaaaaga gtcttgatca 1140 

ggatccagtt gtacgtgctc aggaaattaa aacaaaaatc aaggaattga acaacacatg 1200 

tgaacccgtt gtaacacaac cgaaaccaaa aattgaatca cccaaactgg aaagaactcc 1260 

aaatggccca aatattgata aaaaggaaga agatttagaa gacaaaaaca attttggtgc 1320 

tgaacctcca catcagaatg gtgaatgtta ccctaatgag aaaaattctg ttaatatgga 1380 

cttggactag ataaccttaa attggcctat tccttcaatt aataaaatat ttttgccata 1440 

gtatgtgact ctacataaca tactgaaact atttatattt tcttttttaa ggatatttag 1500 

aaattttgtg tattatatgg aaaaagaaaa aaagcttaag tctgtagtct ttatgatcct 1560 

aaaagggaaa attgccttgg taactttcag attcctgtgg aattgtgaat tcatactaag 1620 

ctttctgtgc agtctcacca tttgcatcac tgaggatgaa actgactttt gtcttttgga 1680 

gaaaaaaaac tgtactgctt gttcaagagg gctgtgatta aaatctttaa gcatttgttc 1740 
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ctgccaaggt agttttcttg cattttgctc tccattcagc atgtgtgtgg gtgtggatgt 1800 

ttataaacaa gactaagtct gacttcataa gggctttcta aaaccatttc tgtccaagag 1860 

aaaatgactt tttgctttga tattaaaaat tcaatgagta aaacaaaagc tagtcaaatg 1920 

tgttagcagc atgcagaaca aaaactttaa actttctctc tcactataca gtatattgtc 1980 

atgtgaaagt gtggaatgga agaaatgtcg atcctgttgt aactgattgt gaacactttt 2040 

atgagcfctta aaataaagtt catcttatgg tgtcatttct a 2081 

<210> 69 

<211> 845 

<212> DNA 

<213> Homo sapien 

<400> 69 

ccgcaaaaag caaagcagaa agccaaacaa gacccaacca acttgggtta ttcattcgat 60 

tcaaaactgt ttttggttaa atcaaacatg aaagtacaaa accgcaggaa cgcgcatctt 120 

agctcatttg atcgcttcgc cttccttggg aaatgcagtt tcgtgtcacc tgttgcagaa 180 

gatatgtagt tgatcatcta gacataattg ctgaagatga acttttggac caacttctaa 240 

gtcacacagc atcattatca gatttattac agaactcggt ggtggccact gcgcagacca 300 

gacttcgctc gtactcgtgc gcctcgcttc gcttttcctc cgcaaccatg tctgacaaac 3 60 

ccgatatggc tgagatcgag aaattcgata agtcgaaact gaagaagaca gagacgcaag 420 

agaaaaatcc actgccttcc aaagaaacga ttgaacagga gaagcaagca ggcgaatcgt 4 80 

aatgaggcgt gcgccgccaa tatgcactgt acattccaca agcattgcct tcttatttta 540 

cttcttttag ctgtttaact ttgtaagatg caaagaggtt ggatcaagtt taaatgactg 600 

tgctgcccct ttcacatcaa agaactactg acaacgaagg ccgcgcctgc ctttcccatc 660 

tgtctatcta tctggctggc agggaaggaa agaacttgca tgttggtgaa ggaagaagtg 720 

gggtggaaga agtggggtgg gacgacagtg aaatctagaa gtgccatttt tttttttgtt 780 

caaatgattt taattattgg aatgcacaat ttttttaata tgcaaataaa aagtttaaaa 840 

actta 845 

<210> 70 

<211> 853 

<212> DNA 

<213> Homo sapien 

<400> 70 

tctccgcctg ccgagctccg acccggcgtr. ccgagtcccc ggcgcccgga ggtgacggtc 60 

gcttcccctg ccctccctta gggttagtcc cgcccaatcg gctccttaat ttttcggagg 120 

ggcttctact ttggcattta tcacgggctg tctgcctcac ttacgaagca gggtgcccct 180 
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agcacactgc cgggccccfcg ggggaacgca cttagggaag cgaagcaaag ctaactgcat 240 

ttggcagaaa ccagcgcgct ttcccacggc gccacgggag cgcaccgcta gacagagaag 300 

gcactagaag aggcggtcgg tgcatagcgt cacctcctaa actccgccca tcggaagcgc 360 

cttctaagct cgttcttccg ccagcttccc tcctcttcct ttctccgcca tcgtggtgtg 420 

ttcttgactc cgctgctcgc catgtcttct cacaagactt tcaggattaa gcgattcctg 480 

gccaagaaac aaaagcaaaa tcgtcccatt ccccagtgga ttcggatgaa aactggaaat 540 



aaaatcagct cttgaagatg atgtgtctcg gagtttgttc cttcagatgt tcagatatgt 


600 


ccagagtttc 


ttccttccag tgggttcgtg gtctcactga cttcaggaat gaagctgcat 


660 


acgctcacgc 


tccacagaaa agttctccaa gtccccactc gacccaggaa gtcctgctgt 


720 


cttcacctgt 


caccagctac tggggaggct gaggcaggag aatcgcttga acccaggagg 


780 


tggaggttgc 


tgggagccaa gccaagatca cgccactaca ctccagcctg gcaacagagt 


840 


gagactccgt 


etc 


853 


<210> 71 

<211> 853 

<212> DNA 

<213> Homo sapien 




<400> 71 
tacggctagg 


tacgaggctg ggtgttgctc ctctcccagt cggcaccaca gcggtggctg 


60 


ccgggcgtgg 


tgtcggtggg tcggttggtt tttgtctcac cgttggtgtc cgtgccgttc 


120 


agttgcccgc 


catggctgag ctggatccgt tcggcgcccc tgccggcgcc cctggcggtc 


180 


ccgcgctggg 


gaacggagtg gccggcgccg gcgaagaaga cccggctgcg gecttcttgg 


240 


cgcagcaaga 


gagegagatt gegggcateg agaacgacga ggccttcgcc atcctggacg 


300 


gcggcgcccc 


cgggccccag ccgcacggcg agccgccggg gggtceggat gctgttgatg 


360 


gagtaatgaa 


tggtgaatac taccaggaaa gtaatggtcc aacagacagt tatgeagcta 


420 


tttcacaagt 


ggatcgattg cagtcagagc ctgaaagtat ccgtaaatgg agagaagaac 


480 


aaatggaacg 


ettggaagee ettgatgeca attcteggaa gcaagaagca gagtggaaag 


540 


aaaaggcaat 


aaaggagcta gaagaatggt atgeaagaca ggacgagcag ctacagaaaa 


600 


caaaagcaaa 


caacagggca gcagaagaag cctttgtaaa tgacattgac gagtcgtccc 


660 


caggcactga 


gtgggaacgg gtggcccggc tgtgtgactt taaccccaag tctagcaagc 


720 


aggccaaaga 


tgtctcccgc atgcgctcag tcctcatctc cctcaagcag gccgctggtg 


780 


ctggaggccc 


cgtggacctc ctggttttac cccggggcct ccgctgggtt ttgggttggc 


840 


ccttgttggc 


ttt 


853 
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<210> 72 
<211> 1666 
<212> DNA 
<213> Homo sapien 

<400> 72 

ctagttctag atcgcgggcg gccgctcgcg atctagaacc ggacgcgtgg ggcgcttgtc 60 

ctcctctccc agtcggcacc acagcggtgg ctgccgggcg tggtgtcggt gggtcggttg 120 

gtttttgtct caccgttggt gtccgtgccg ttcagttgcc cgccatggct gagctggatc 180 

cgttcggcgc ccctgccggc gcccctggcg gtcccgcgct ggggaacgga gtggccggcg 24 0 

ccggcgaaga agacccggct gcggccttct tggcgcagca agagagcgag attgcgggca 300 

tcgagaacga cgaggccttc gccatcctgg acggcggcgc ccccgggccc cagccgcacg 360 

gcgagccgcc ggggggtccg gatgcaggaa agtaatggtc caacagacag ttatgcagct 420 

atttcacaag tggatcgatt gcagtcagag cctgaaagta tccgtaaatg gagagaagaa 480 

caaatggaac gcttggaagc ccttgatgcc aattctcgga agcaagaagc agagtggaaa 540 

gaaaaggcaa taaaggagct agaagaatgg tatgcaagac aggacgagca gctacagaaa 600 

acaaaagcaa acaacagggc agcagaagaa gcctttgtaa atgacattga cgagtcgtcc 660 

ccaggcactg agtgggaacg ggtggcccgg ctgtgtgact ttaaccccaa gtctagcaag 720 

caggccaaag atgtctcccg catgcgctca gtcctcatct ccctcaagca ggccccgctg 780 

gtgcactgaa gagccaccct gtggaaacac tacatctgca atatcttaat cctactcagt 840 

gaagctcttc acagtcattg gattaattat gttgagttct tttggaccaa acctttttgt 900 

ctttagagtt gttcattgtt tgtgattgca tgtttccttc cttcaactgt gttctccctg 960 

gcattcagag aggagggaga ggaggaagag gaaggggagg gaagcttccc aagagtagcc 1020 

tcaacctgtg cttctgtgca ttattctgag aataaatttc tgtttcaaac tgtaaaaaaa 1080 

aaaaaaaaca caaaacacaa cacacaaaaa cacaaaaaca caaaaaaaaa aacaaacaac 1140 

acacaaaaca aacacacaac acacaccaca cacaaacaaa aaacaaacac aaagaggggg 1200 

ggggcgcata tatatataat aaccaccccg cgggaggggc acaatattat catcacacac 1260 

aacaccacaa tattatgaga aaaaaaggcg cccccaatta agagaggggt gtatataata 1320 

aaaaaagaga agagggccca cttttaatat acacaaccgg ctaagagcgg agaagaaaca 1380 

aacacacgct tgtccgccag gagaactctt ttggtggaaa gacacccacc ccatctctcc 1440 

tgcgggggag gcaaacaaat caatgtggac caacaccaca cccctccaga agaggagatt 1500 

ataacccccc ccctccacgg agggatcatc aacaacacca acacagtgag agagatcata 1560 

agggtggaca caaaaaccgc gaccatacaa tcagctgatg aggagctgaa aaaaaatcat 1620 
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tgtcatcaca ccgggcgttc aattctaaaa aaaaattata tagaga 1666 

<210> 73 

<211> 1542 

<212> DNA 

<213> Homo sapien 

<400> 73 

ctagttctag atcgcgggcg gccgctcgcg atctagaacc ggacgcgtgg ggcgcttgtc 6 0 

ctcctctccc agtcggcacc acagcggtgg ctgccgggcg tggtgtcggt gggtcggttg 120 

gtttttgtct caccgttggt gtccgtgccg ttcagttgcc cgccatggct gagctggatc 180 

cgttcggcgc ccctgccggc gcccctggcg gtcccgcgct ggggaacgga gtggccggcg 24 0 

ccggcgaaga agacccggct gcggccttct tggcgcagca agagagcgag attgcgggca 3 00 

tcgagaacga cgaggccttc gccatcctgg acggcggcgc ccccgggccc cagccgcacg 360 

gcgagccgcc ggggggtccg gatgccaatt ctcggaagca agaagcagag tggaaagaaa 420 

aggcaataaa ggagctagaa gaatggtatg caagacagga cgagcagcta cagaaaacaa 480 

aagcaaacaa cagggcagca gaagaagcct ttgtaaatga cattgacgag tcgtccccag 54 0 

gcactgagtg ggaacgggtg gcccggctgt gtgactttaa ccccaagtct agcaagcagg 600 

ccaaagatgt ctcccgcatg cgctcagtcc tcatctccct caagcaggcc ccgctggtgc 660 

actgaagagc caccctgtgg aaacactaca tctgcaatat cttaatccta ctcagtgaag 720 

ctcttcacag tcattggatt aattatgttg agttcttttg gaccaaacct ttttgtcttt 780 

agagttgttc attgtttgtg attgcatgtt tccttccttc aactgtgttc tccctggcat 84 0 

tcagagagga gggagaggag gaagaggaag gggagggaag cttcccaaga gtagcctcaa 900 

cctgtgcttc tgtgcattat tctgagaata aatttctgtt tcaaactgta aaaaaaaaaa 960 

aaaacacaaa acacaacaca caaaaacaca aaaacacaaa aaaaaaaaca aacaacacac 1020 

aaaacaaaca cacaacacac accacacaca aacaaaaaac aaacacaaag aggggggggg 1080 

cgcatatata tataataacc accccgcggg aggggcacaa tattatcatc acacacaaca 1140 

ccacaatatt atgagaaaaa aaggcgcccc caattaagag aggggtgtat ataataaaaa 1200 

aagagaagag ggcccacttt taatatacac aaccggctaa gagcggagaa gaaacaaaca 1260 

cacgcttgtc cgccaggaga actcttttgg tggaaagaca cccaccccat ctctcctgcg 1320 

ggggaggcaa acaaatcaat gtggaccaac accacacccc tccagaagag gagattataa 1380 

ccccccccct ccacggaggg atcatcaaca acaccaacac agtgagagag atcataaggg 1440 

tggacacaaa aaccgcgacc atacaatcag ctgatgagga gctgaaaaaa aatcattgtc 1500 

atcacaccgg gcgttcaatt ctaaaaaaaa attatataga ga 1542 
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<210> 74 

<211> 1958 

<212> DNA 

<213> Homo sapien 

<400> 74 

gcagacgggg tgggtaggac gctccctatc aggcagcact tccgcctccc ggggcccgcg 60 

cagctcacct ccctcacctc ccgccctacc ccagtcacga gttgttttag ggggaccgcc 120 

cctccacttg ctgattgggt agctcctgaa ccattgttgt cctctgattg gttgttccct 180 

tttcggctct gcaacaccgc ctagaccgac cggatacacg ggtagggctt ccgctttacc 240 

cgtctccctc ctggcgcttg tcctcctctc ccagtcggca ccacagcggt ggctgccggg 300 

cgtggtgtcg gtgggtcggt tggtttttgt ctcaccgttg gtgtccgtgc cgttcagttg 360 

cccgccatgg ctgagctgga tccgttcggc gcccctgccg gcgcccctgg cggtcccgcg 420 

ctggggaacg gagtggccgg cgccggcgaa gaagacccgg ctgcggcctt cttggcgcag 480 

caagagagcg agattgcggg catcgagaac gacgaggcct tcgccatcct ggacggcggc 540 

gcccccgggc cccagccgca cggcgagccg ccggggggtc cggatgctgt tgatggagta 600 

atgaatggtg aatactacca ggaaagtaat ggtccaacag acagttatgc agctatttca 660 

caagtggatc gattgcagtc agagcctgaa agtatccgta aatggagaga agaacaaatg 720 

gaacgcttgg aagcccttga tgccaattct cggaagcaag aagcagagtg gaaagaaaag 780 

gcaataaagg agctagaaga atggtatgca agacaggacg agcagctaca gaaaacaaaa 840 

gcaaacaaca gggtggcaga tgaagctttc tacaaacaac ccttcgctga cgtgattggt 900 

tatgtgcagg gcagcagaag aagcctttgt aaatgacatt gacgagtcgt ccccaggcac 960 

tgagtgggaa cgggtggccc ggctgtgtga ctttaacccc aagtctagca agcaggccaa 1020 

agatgtctcc cgcatgcgct cagtcctcat ctccctcaag caggccccgc tggtgcactg 1080 

aagagccacc ctgtggaaac actacatctg caatatctta atcctactca gtgaagctct 1140 

tcacagtcat tggattaatt atgttgagtt cttttggacc aaaccttttt gtctttagag 1200 

ttgttcattg tttgtgattg catgtttcct tccttcaact gtgttctccc tggcattcag 1260 

agaggaggga gaggaggaag aggaagggga gggaagcttc ccaagagtag cctcaacctg 1320 

tgcttctgtg cattattctg agaataaatt tctgtttcaa actgtaaaaa aaaaaaaaaa 1380 

cacaaaacac aacacacaaa aacacaaaaa cacaaaaaaa aaaacaaaca acacacaaaa 1440 

caaacacaca acacacacca cacacaaaca aaaaacaaac acaaagaggg ggggggcgca 15 00 

tatatatata ataaccaccc cgcgggaggg gcacaatatt atcatcacac acaacaccac 1560 

aatattatga gaaaaaaagg cgcccccaat taagagaggg gtgtatataa taaaaaaaga 1620 
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gaagagggcc cacttttaat atacacaacc ggctaagagc ggagaagaaa caaacacacg 1680 

cttgtccgcc aggagaactc ttttggtgga aagacaccca ccccatctct cctgcggggg 1740 

aggcaaacaa atcaatgtgg accaacacca cacccctcca gaagaggaga ttataacccc 1800 

ccccctccac ggagggatca tcaacaacac caacacagtg agagagatca taagggtgga I860 

cacaaaaacc gcgaccatac aatcagctga tgaggagctg aaaaaaaatc attgtcatca 1920 

caccgggcgt tcaattctaa aaaaaaatta tatagaga 1958 

<21Q> 75 
<211> 1233 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1232) .. (1232) 
<223> n=a, c, g or t 

<400> 75 

gcagacgggg tgggtaggac gctccctatc aggcagcact tccgcctccc ggggcccgcg 60 

cagctcacct ccctcacctc ccgccctacc ccagtcacga gttgttttag ggggaccgcc 120 

cctccacttg ctgattgggt agctcctgaa ccattgttgt cctctgattg gttgttccct 180 

tttcggctct gcaacaccgc ctagaccgac cggatacacg ggtagggctt ccgctttacc 240 

cgtctccctc ctggcgcttg tcctcctctc ccagtcggca ccacagcggt ggctgccggg 300 

cgtggtgtcg gtgggtcggt tggtttttgt ctcaccgttg gtgtccgtgc cgttcagttg 360 

cccgccatgg ctgagctgga tccgttcggc gcccctgccg gcgcccctgg cggtcccgcg 420 

ctggggaacg gagtggccgg cgccggcgaa gaagacccgg ctgcggcctt cttggcgcag 480 

caagagagcg agattgcggg catcgagaac gacgaggcct tcgccatcct ggacggcggc 540 

gcccccgggc cccagccgca cggcgagccg ccggggggtc cggatgctgt tgatggagta 600 

atgaatggtg aatactacca ggaaagtaat ggtccaacag acagttatgc agctatttca 66 0 

caagtggatc gattgcagtc agagcctgaa agtatccgta aatggagaga agaacaaatg 720 

gaacgcttgg aagcccttga tgccaattct cggaagcaag aagcagagtg gaaagaaaag 780 

gcaataaagg agctagaaga atggtatgca agacaggacg agcagctaca gaaaacaaaa 84 0 

gcaaacaaca gggcagcaga agaagccttt gtaaatgaca ttgacgagtc gtccccaggc 900 

actgagtggg aacgggtggc ccggctgtgt gactttaacc ccaagtctag caagcaggcc 960 

aaagatgtct cccgcatgcg ctcagtcctc atctccctca agcaggcccg kkggtgatga 1020 

rragcacctg cggaamacta atttgatatt tatctatcag ggaagtttaa gcatgatata 1080 
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tgttgaggtt ttggagaacc tttgttcgag tgtcttgttg gtggggttct ctaagggttc 1140 

cggtaagagg gggagaaagg gggcaaaacg ttggtttaaa tttgggagaa aaaaaaaggg 1200 

tttggggtgt tttgtggggt tgaaatttat cna 12 33 

<210> 76 
<211> 1262 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1262) . . (1262) 
<223> n=a, c, g or t 

<400> 76 

gcagacgggg tgggtaggac gctccctatc aggcagcact tccgcctccc ggggcccgcg 60 

cagctcacct ccctcacctc ccgccctacc ccagtcacga gttgttttag ggggaccgcc 120 

cctccacttg ctgattgggt agctcctgaa ccattgttgt cctctgattg gttgttccct 180 

tttcggctct gcaacaccgc ctagaccgac cggatacacg ggtagggctt ccgctttacc 240 

cgtctccctc ctggcgcttg tcctcctctc ccagtcggca ccacagcggt ggctgccggg 3 00 

cgtggtgtcg gtgggtcggt tggtttttgt ctcaccgttg gtgtccgtgc cgttcagttg 360 

cccgccatgg ctgagctgga tccgttcggc gcccctgccg gcgcccctgg cggtcccgcg 420 

ctggggaacg gagtggccgg cgccggcgaa gaagacccgg ctgcggcctt cttggcgcag 480 

caagagagcg agattgcggg catcgagaac gacgaggcct tcgccatcct ggacggcggc 540 

gcccccgggc cccagccgca cggcgagccg ccggggggtc cggatgctgt tgatggagta 600 

atgaatggtg aatactacca ggaaagtaat ggtccaacag acagttatgc agctatttca 660 

caagtggatc gattgcagtc agagcctgaa agtatccgta aatggagaga agaacaaatg 720 

gaacgcttgg aagcccttga tgccaattct cggaagcaag aagcagagtg gaaagaaaag 780 

gcaataaagg agctagaaga atggtatgca agacaggacg agcagctaca gaaaacaaaa 84 0 

gcaaacaaca gggcagcaga agaagccttt gtaaatgaca ttgacgagtc gtccccaggc 900 

actgagtggg aacgggtggc ccggctgtgt gactttaacc ccaagtctag caagcgggca 960 

agatgtttcc cgatggctca gtctcattcc tcaaggaggc ccgtgtgcct gagagccctg 1020 

tggaacctac attgatattt atctatcagt gagtctcaag tcatgttatt atgttgagtt 1080 

ctttggccaa ccttttgttt tagagtgtct ggttgggatg ctggtcctct caaggtttcc 114 0 

ggctcagagg ggggggaggg agggggggcc cagavccccg gtgggtttga aattgtggga 1200 

aaaaacgttt kggrrgakgg gtgtcctttt gcttagcctt ttatgggccg gatctataac 1260 
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<210> 77 
<211> 1451 
<212> DNA 
<213> Homo sapien 

<400> 77 

agctcaccct ggtcagccta gcactctgac ctagcagtca acatgaaggc tctcattgtt 60 

ctggggcttg tcctcctttc tgttacggtc cagggcaagg tctttgaaag gtgtgagttg 120 

gccagaactc tgaaaagatt gggaatggat ggctacaggg gaatcagcct agcaaactgg 180 

atgtgtttgg ccaaatggga gagtggttac aacacacgag ctacaaacta caatgctgga 240 

gacagaagca ctgattatgg gatatttcag atcaatagcc gctactggtg taatgatggc 300 

aaaaccccag gagcagttaa tgcctgtcat ttatcctgca gtgctttgct gcaagataac 3 60 

atcgctgatg ctgtagcttg tgcaaagagg gttgtccgtg atccacaagg cattagagca 420 

tgggtggcat ggagaaatcg ttgtctttcc ttcttcagct cattttgtct ctctcacatt 480 

aagggagtag gaattaagtg aaaggtcaca ctaccattat ttccccttca aacaaataat 540 

atttttacag aagcaggagc aaaatatggc ctttcttcta agagatataa tgttcactaa 600 

tgtggttatt ttatattaag cctacaacat ttttcagttt gcaaatagaa ctaatactgg 660 

tgaaaattta cctaaaacct tggttatcaa atacatctcc agtacattcc gttctttttt 720 

tttttgagac agtctcgctc tgtcgcccag gctggagtgc agtggcgcaa tctcggctca 780 

ctgcaacctc cacctcccgg gttcacgcca ttctcctgcc tcagcctccc gagtagctgg 840 

gattacgggc gcccgccacc acgcccggct aattttttgt atttttagta gagacagggt 900 

ttcaccgtgt tagccaggat ggtctcgatc tcctgacctt gtgatccacc cacctcggcc 960 

tcccaaagtg ctgggattac aggcgtgagc cactgcgccc ggccacattc agttcttatc 1020 

aaagaaataa cccagactta atcttgaatg atacgattat gcccaatatt aagtaaaaaa 1080 

tataagaaaa ggttatctta aatagatctt aggcaaaata ccagctgatg aaggcatctg 1140 

atgccttcat ctgttcagtc atctccaaaa acagtaaaaa taaccacttt ttgttgggca 1200 

atatgaaatt tttaaaggag tagaatacca aatgatagaa acagactgcc tgaattgaga 1260 

attttgattt cttaaagtgt gtttctttct aaattgctgt tccttaattt gattaattta 1320 

attcatgtat tatgattaaa tctgaggcag atgagcttac aagtattgaa ataattacta 1380 

attaatcaca aatgtgaagt tatgcatgat gtaaaaaata caaacattct aattaaaggc 1440 

tttgcaacac a 1451 
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<210> 78 
<211> 1259 
<212> DNA 
<213> Homo sapien 

<400> 78 

gtctgaggga ggggcctgga ctccgggctc tgagcctact ggggcacagg cctgtgagtt 60 

gaggcccctt ctcatcatcc ccctcctcag gtggtctaca tagcctgctc cttcaccacg 120 

gtctggttga tttatagcaa gttcaaagct acttacgatg ggaaccatga cacgttcaga 180 

gtggagttcc tggtcgttcc cacagccatt ctggcgttcc tggtcaatca tgacttcacc 240 

cctctggaga tcctctggac cttctccatc tacctggagt cagtggccat cttgccgcag 300 

ctgttcatgg tgagcaagac cggcgaggcg gagaccatca ccagccacta cttgtttgcg 360 

ctaggcgttt accgcacgct ctatctcttc aactggatct ggcgctacca tttcgagggc 420 

ttcttcgacc tcatcgccat tgtggcaggc ctggtccaga cagtcctcta ctgcgatttc 480 

ttctacctct atatcaccaa agtcctaaag gggaagaagt tgagtttgcc ggcatagccc 540 

cggtcctctc catctctctc ctcggcagca gcgggaggca gaggaaggcg gcagaagatg 600 

aagagctttc ccatccaggg gtgacttttt taagaaccca cctcttgtgc tccccatccc 660 

gcctcctgcc gggtttcagg gggacagtgg aggatccagg tcttggggag ctcaggactt 720 

gggctgtttg tagttttttg ccttttagac aagaaaaaaa aatctttcca ctctttagtt 780 

tttgattctg atgactcgtt tttcttctac tctgtggccc caatttttat aaagtgtttt 840 

tgagtgtcct atgggccggg gcagggtcca agatcttttc ccttccccag gcccctcggc 900 

tccctcccag atcccacccc cagccccact ggttgccaaa cactaaatct gccgacaccc 960 

atctgcccca cctcctgcca tggccatgaa ccgcgacccc cactaaattt ctagattggg 1020 

gatagggaga aagggaggcc caggaaggtc tcccctgatt ttttttcata gtaatttttt 1080 

tccccagagt ttgaattttt tggtcttctc ctggtttttt ggcaaattag gggggcccgg 1140 

ggctcaagtg cgggaagggg gctggcccga ggatcccatg gctctcacac catgtttttg 1200 

tacagaactg atggttgaat ctttgttctc ttgaaataaa cagaagaaaa tgaaacctt 1259 

<210> 79 
<211> 3882 
<212> DNA 
<213> Homo sapien 



<400> 79 

gaattcgtgg gtcgtgctgc 



gtcgacaacg 



gtagtgacgc 



gtattgcctg 



gaggatggcg 



60 



gacgccggca ttcgccgcgt 



ggttcccagc 



gacctgtatc 



ccctcgtgct 



cggcttcctg 



120 



cgcgataacc aactctcaga 



ggtggccaat 



aagttcgcca 



aagcgacagg 



agctacacag 



180 
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caggatgcca 


atgcctcttc 


cctcttagac 


atctatagct 


tctggctcaa 


caggtctgcc 


240 


aaggtcccag 


agcgaaagtt 


acaggcaaat 


ggaccagtgg 


ctaagaaagc 


taagaagaag 


300 


gcctcatcca' 


gtgacagtga 


ggacagcagc 


gaggaggagg 


aggaagttca 


agggcctcca 


360 


gcaaagaagg 


ctgctgtacc 


tgccaagcga 


gtcggtctgc 


ctcctgggaa 


ggctgcagcc 


420 


aaagcatcag 


agagtagcag 


cagtgaagag 


tccagtgatg 


atgatgatga 


ggaggaccaa 


480 


.-j aagaaacagc 


ctgtccagaa 


gggagttaag 


ccccaagcca 


aggcagccaa 


agctcctcct 


540 


; aagaaggcca 


agagctctga 


ttctgattct 


gactcaagct 


ccgaggatga 


gccaccaaag 


600 


aaccagaagc 


caaagataac 


acctgtgaca 


gttaaagctc 


agactaaagc 


ccctcccaaa 


660 


j ccagctcgag 


cagcacctaa 


aatagccaat 


ggtaaagcag 


ccagtagcag 


cagtagcagc 


720 


; agcagcagca 


gtagcagtga 


tgactcagag 


gaggagaagg 


cagcagccac 


ccccaagaag 


780 


-j gtctggacca 


taacttctgt 


cagggcagag 


gtgaccaggg 


tgtgatgtgt 


gtgtgtcttc 


840 


i cttccctggc 


aggatfcggtt 


gggtcagcat 


tttcctgtgg 


taatcaattt 


tctttctaat 


900 


gcagactgta 


cctaaaaagc 


aagttgtggc 


caaggcccca 


gtgaaagcag 


ctaccacccc 


960 


; tacccggaag 

i 


agttctagca 


gtgaggattc 


ctccagtgac 


gaggaagagg 


agcaaaaaaa 


1020 


' acccatgaaa 

! 


aataaaccag 


gtccctacag 


ttcagtcccc 


ccgccttctg 


ctcccccacc 


1080 


j aaagaagtct 


ctgggaaccc 


agcctcccaa 


gaaggctgtg 


gagaagcagc 


agcctgtgga 


1140 


! 

! aagcagtgaa 

i 


gacagcagtg 


atgagtctga 


ttcaagttct 


gaagaagaga 


agaaaccccc 


1200 


I aactaaggca 


gtagtctcta 


aagcaaccac 


taaaccacct 


ccagcaaaga 


aagcagcaga 


1260 


gagctcttca 


gacagctcag 


actctgacag 


ctctgaggat 


gatgaagctc 


cttctaagcc 


1320 


agctggfcacc 


accaagaatt 


cttcaaataa 


gccagctgtc 


accaccaagt 


cacctgcagt 


1380 


gaagccagct 


gcagccccca 


agcaacctgt 


gggcggtggc 


cagaagcttc 


tgacgagaaa 


1440 


ggctgacagc 


agctccagtg 


aggaagagag 


cagctccagt 


gaggaggaga 


agacaaagaa 


1500 


gatggtggcc 


accactaagc 


ccaaggcgac 


tgccaaagca 


gctctatctc 


tgcctgccaa 


1560 


gcaggctcct 


ca gggtagta 


gggacagcag 


ctctgattca 


gacagctcca 


gcagtgagga 


1620 


ggaggaagag 


aagacatcta 


agtctgcagt 


taagaagaag 


ccacagaagg 


tagcaggagg 


1680 


tgcagcccct 


tccaagccag 


cctctgcaaa 


gaaaggaaag 


gctgagagca gcaacagttc 


1740 


ttcttctgat 


gactccagtg 


aggaagagga 


agagaagctc 


aa gggcaagg 


gctctccaag 


1800 


accacaagcc 


cccaaggcca 






gcccagaatg 


gaaaagcagc 


lODU 


taagaacagt 


gaggaggagg 


aagaagaaaa 


gaaaaaggcg 


gcagtggtag 


tttccaaatc 


1920 


1 aggttcatta 


aagaagcgga 


agcagaatga 


ggctgccaag 


gaggcagaga 


ctcctcaggc 


1980 


caagaagata 


aagcttcaga 


cccctaacac 


atttccaaaa 


aggaagaaag 


gagaaaaaag 


2040 
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ggcatcatcc ccattccgaa gggtcaggga ggaggaaatt gaggtggatt cacgagttgc 2100 

ggacaactcc tttgatgcca agcgaggtgc agccggagac tggggagagc gagccaatca 2160 

ggttttgaag ttcaccaaag gcaagtcctt tcggcatgag aaaaccaaga agaagcgggg 2220 

cagctaccgg ggaggctcaa tctctgtcca ggtcaattct attaagtttg acagcgagtg 22 80 

acctgaggcc atcttcggtg aagcaagggfc gatgatcgga gactacttac tttctccagt 2340 

ggacctggga accctcaggt ctctaggtga gggtcttgat gaggacagaa gtttagagta 2400 

ggtcctaaga ctttacagtg taacatcctc tctggtcctt ttctgtgttc ctagttttgt 2460 

acagacttgt ttttgagtgt tgagtagcag ggacaaaata agggaatgtt attttttaag 252 0 

aaaattcatt ttcattgttg tctccttcct tttctgtgaa agtcctcata ctgagaaatt 2580 

tgtatatttt atattaaatc acttactatt gatttttgtt gtgattttca aaggtggatt 2640 

cccacagata aaatcttggc tattgcccaa aacatagtaa agggtcacgt gtgacttttt 2700 

ataataggaa gaaaattctg cctttgtgag tgcacatgtc cacatttcat ccctccttcc 2760 

ctcaaaaccc tagagagggg cattaaagaa ttgttgatgt atatgcaatg tctgttaagc 2820 

atgcactatg tatttcatcc tcatttattg ggtctgggac tgaagttttt agccagcatg 2880 

gacctaacct actttttggg ataaaattct ctgttttgtt acaggcaaaa ttctggtatg 2940 

gcgtgaatgc catgggtcat tctgaatata tttttttctg taattttatc attacacgat 3000 

gtttgcaata cgtgctttgt tttttaattt gaaagcaaac ttttctactg ttgaaagaca 3060 

ttttttgaca acttgaccct tcctagtatt gagttctaag ttgaggactg catcttctcg 3120 

ttttttacag tatagagaac aaaatgacat gagtttgaaa aatacatatc acttggtatt 3180 

gctgtcttgg ttgcagtggt gatacagaat tggtttcatt aattcctaca tggttgagaa 3240 

tcactgatca agaaagtggg gggaaaaaaa acaaacgtta aaacctcaat cctcagtagg 3300 

aaggtagatt acattaggtg aaattatagg taatctatgt atgtgctaat ggggttggaa 3360 

agaaccttac agagcatatt acctgataaa ctggagtggg tttgggagaa caaactaata 3420 

ggattattgt gtctcctagt tggtacctgg gagcaattga catgccccct tcagaacctt 3480 

aactgttagt agcagtggct gtaacaacac aaaccagtga ccagagataa cagcttttag 3540 

gccaagctgg cctgacggta tggctgcagg aagtgactga gcagtagcgg tactcagcca 3600 

gaccaagacg gagagggaag agtccacagc tttctggaag ctaaggcatt ctggtggtag 3660 

aaaagtgtgc cccaagcctt catggacgag ttataggtct taagattagt ctcctcttgt 3720 

ttggattcca tacttgctaa ataacctgat aataacctgg ttttccatgt aactgcctct 3780 

aggaagaaaa tgtactgttc atgctgacac agatatttca gtctgcatgg taaaagttct 3840 
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aaatcttact acaaaataat aaactggctg gtttataatg tg 3882 

<210> 80 
<211> 2467 
<212> DNA 
<213> Homo sapien 

<400> 80 

ctcagtggcc acctccttct caagcaaagt gggcccacgc agccagagag gacagccttc 60 

ctgagtaatc ctcagcccct gattttgcaa acctgaagca ctatcaaaaa cagcagagtc 120 

ttccaagttt atgcagcact tctgacccag acacacctct tggggccccg agcagctctg" 180 

attcagacag ctccagcagt gaggaggagg aagagaagac atctaagtct gcagttaaga 240 

agaagccaca gaaggtagca ggaggtgcag ccccttccaa gccagcctct gcaaagaaag 3 00 

gaaaggctga gagcagcaac agttcttctt ctgatgactc cagtgaggaa gaggaagaga 360 

agctcaaggg caagggctct ccaagaccac aagcccccaa ggccaatggc acctctgcac 420 

tgactgccca gaatggaaaa gcagctaaga acagtgagga ggaggaagaa gaaaagaaaa 4 80 

aggcggcagt ggtagtttcc aaatcaggtt cattaaagaa gcggaagcag aatgaggctg 540 

ccaaggaggc agagactcct caggccaaga agataaagct tcagacccct aacacatttc 600 

caaaaaggaa gaaaggagaa aaaagggcat catccccatt ccgaagggtc agggaggagg 66 0 

aaattgaggt ggattcacga gttgcggaca actcctttga tgccaagcga ggtgcagccg 720 

gagactgggg agagcgagcc aatcaggttt tgaagttcac caaaggcaag tcctttcggc 780 

atgagaaaac caagaagaag cggggcagct accggggagg ctcaatctct gtccaggtca 840 

attctattaa gtttgacagc gagtgacctg aggccatctt cggtgaagca agggtgatga 900 

tcggagacta cttactttct ccagtggacc tgggaaccct caggtctcta ggtgagggtc 960 

ttgatgagga cagaagttta gagtaggtcc taagacttta cagtgtaaca tcctctctgg 1020 

tccttttctg tgttcctagt tttgtacaga cttgtttttg agtgttgagt agcagggaca 1080 

aaataaggga atgttatttt ttaagaaaat tcattttcat tgttgtctcc ttccttttct 1140 

gtgaaagtcc tcatactgag aaatttgtat attttatatt aaatcactta ctattgattt 1200 

ttgttgtgat tttcaaaggt ggattcccac agataaaatc ttggctattg cccaaaacat 1260 

agtaaagggt cacgtgtgac tttttataat aggaagaaaa ttctgccttt gtgagtgcac 1320 

atgtccacat ttcatccctc cttccctcaa aaccctagag aggggcatta aagaattgtt 13 80 

gatgtatatg caatgtctgt taagcatgca ctatgtattt catcctcatt tattgggtct 1440 

gggactgaag tttttagcca gcatggacct aacctacttt ttgggataaa attctctgtt 1500 

ttgttacagg caaaattctg gtatggcgtg aatgccatgg gtcattctga atatattttt 1560 
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ttctgtaatt ttatcattac acgatgtttg caatacgtgc tttgtttttt aatttgaaag 1620 



caaacttttc 


tactgttgaa 


agacattttt tgacaacttg acccttccta gtattgagtt 


1680 


ctaagttgag 


gactgcatct 


tctcgttttt tacagtatag agaacaaaat gacatgagtt 


1740 


tgaaaaatac 


atatcacttg 


gtattgctgt cttggttgca gtggtgatac agaattggtt 


1800 


tcattaattc 


ctacatggtt 


gagaatcact gatcaagaaa 


gtggggggaa aaaaaacaaa 


1860 


cgttaaaacc 


tcaatcctca 


gtaggaaggt agattacatt 


aggtgaaatt ataggtaatc 


1920 


tatgtatgtg 


ctaatggggt 


tggaaagaac cttacagagc 


atattacctg ataaactgga 


1980 


gtgggtttgg 


gagaacaaac 


taataggatt attgtgtctc 


ctagttggta cctgggagca 


2040 


attgacatgc 


ccccttcaga 


accttaactg ttagtagcag 


tggctgtaac aacacaaacc 


2100 


agtgaccaga 


gataacagct 


tttaggccaa gctggcctga 


cggtatggct gcaggaagtg 


2160 


actgagcagt 


agcggtactc 


agccagacca agacggagag 


ggaagagtcc acagctttct 


2220 


ggaagctaag 


gcattctggt 


ggtagaaaag tgtgccccaa 


gccttcatgg acgagttata 


2280 


ggtcttaaga 


ttagtctcct 


cttgtttgga ttccatactt 


gctaaataac ctgataataa 


2340 


cctggttttc 


catgtaactg 


cctctaggaa gaaaatgtac 


tgttcatgct gacacagata 


2400 


tttcagtctg 


catggtaaaa 


gttctaaatc ttactacaaa 


ataataaact ggctggttta 


2460 


taatgtg 








2467 


<210> 81 

<211> 7144 

<212> DNA 

<213> Homo sapien 








<400> 81 
gcgctcgggc 


caatgagaga 


taggctcatg gcgcggctgt 


agctggaact gaccaatagg 


60 


agggcgggct 


gggggcggag 


gcccaggttc caaacgctcc 


gcggcgccat gggctgaaaa 


120 


ctcaaccgag 


atggaatctc 


ccagtctgaa ccggtttcaa 


ccggttccga gtttgaggca 


180 


ctaggaggag 


ggggagaagc 


ggctgcagcg gccgcggcag gagcagcggg agctacagca 


240 


tcagcaagag 


caacagtagc 


tacagccccg gcggcggtgc 


ctgttccagt ctttgctgct 


300 


gcagtccgtg 


caaccaccca 


gagggggagg ggggaaccac 


cagtcgctga ggaacaagag 


360 


aaggggggaa 


agtttaggcg 


agccttgggg ggggggggcc 


agcgccggag ccgcgtgaga 


*± ZD 


gagggagccg 


tgttttggta 


ggggggagtc ggactgcaac 


tggcagcaga gcgtctcccc 


480 


ggccgtgtgg 


actctacacc 


tcctactcct gccgcttctg ctgctgcctg tggctggagg 


540 


gtccccctgg 


ggctgaatct 


ttgggacttg accccgttcc 


ctcccccttc cctcactccc 


600 


cagccgggcg 


ggagcattta 


ttccccagat taattcccct 


tttggggggg ggcgggtgtg 


660 
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tgtgtgtgtg tgtgtgtgtg tgtgtgtgtt gggggaagcg tccctgaaat agtaaatatt 720 

attgagctct ttttgccctt ttcctgtccg tttttttaat ttcctttttt gaggtgggaa 780 

aactgaaacc caccttgatt cgtcccctct cccccctccc caccttccct cgccctaatc 840 

ccccaacgag gaaggaagga gcagttggtt caatctctgg taatctatgc cagcaattat 900 

gacaatgtta gcagaccatg cagctcgtca gctgcttgat ttcagccaaa aactggatat 960 

caacttatta gataatgtgg tgaattgctt ataccatgga gaaggagccc agcaaagaat 102 0 

ggctcaagaa gtactgacac atttaaagga gcatcctgat gcttggacaa gagtcgacac 1080 

aattttggaa ttttctcaga atatgaatac gaaagtaagc aagtggtggt tttaaaatct 1140 

gttatttctt ttcttccagt cataaacttt gaggaaggta tctcattttt agtattagca 1200 

tttgttatct caaattattt tatattgtta atttgtttca aaatttagtc ttaagcatat 1260 

aattaattct tgcaaatagt cctaagaatt atgaatgaag ttttgtaatg ggactagtat 1320 

gcaaggtggc ttttttggtt gtctgatttt gtttttttct ttttagtata tgtatacagg 1380 

tctaagtagc ataaattctg aatgttattc agagattaaa cataacatta aggaattatt 1440 

ttgggcaagt aatcaatata atacaggcca ggcacctctg acaccaagtt gtgtggacag 1500 

cttaaaaccc tactccatgg cattgggctt ctaatgggac agcactcagt ttttatttac 1560 

aatggaaaat gttataattc tggtcctttt ttaagtttga acagaagggt tgatcaaaat 1620 

gtgttttgtc tgttttaggc ttgatgattc acgcttctct ttaaactgcc ttaaagtaat 1680 

aaatactatg gcattctgtt taatacacga aaggtttcca cttgatatac attgtgttaa 1740 

aacagaagat agccatattc acatctgttt ctgaaaccag actttgttaa. ttaacagttt 1800 

taaaatatta attagttttg acatcttgta aattttaggt caaaaataca gtccagaact 1860 

tcctcaaagt gctaacctta agtaacataa cagtgtttgc ctgtttgctt gcctggtaaa 1920 

ttttgctttt tatactgaca gacattacta atgtttaagt tgatttatca acagtatttt 1980 

taaaaactca aatttgtctt gtgaaaaatg agatgattag aggaacagaa agaaagcaac 2040 

aggtaaagaa ttcfctcaatt tacataaatt acaaatgatt tggtgctaaa tttatgcact 2100 

atgacatttg gcttttatgc aatcattttt ttttctttaa agatgttaac actacctgat 2160 

attaatcata atgttccatt atgttgattg cttgtaatag gaagaaaagg ggtgtcttcc 222 0 

tgtgcaactg acacagctag gctttgacgg caggcctcca caagtactat ggactacaaa 2280 

ttttggaaaa tgtgataaaa acaaggtgga agattcttcc aaggaaccag tgcgaaggaa 2340 

taaaaaaata cgttgttggc ctcattatca agacgtcatc tgacccaact tgtgtagaga 2400 

aagaaaaggt gtatatcgga aaattaaata tgatccttgt tcagatactg aaacaagaat 2460 

ggcccaaaca ttggccaact tttatcagtg atattgttgg agcaagtagg accagcgaaa 2520 
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gtctctgtca aaataatatg gtgattctta aactcfctgag tgaagaagta tttgatttct 2580 

ctagtggaca gataacccaa gtcaaatcta agcatttaaa agacagcatg tgcaatgaat 2640 

tctcacagat atttcaactg tgtcagtttg taatggaaaa ttctcaaaat gctccacttg 2700 

tacatgcaac cttggaaaca ttgctcagat ttctgaactg gattcccctg ggatatattt 2760 

ttgagaccaa attaatcagc acattgattt ataagttcct gaatgttcca atgtttcgaa 2820 

atgtctctct gaagtgcctc actgagattg ctggtgtgag tgtaagccaa tatgaagaac 2880 

aatttgtaac actatttact ctgacaatga tgcaactaaa gcagatgctt cctttaaata 2940 

ccaatattcg acttgcgtac tcaaatggaa aagatgatga acagaacttc attcaaaatc 3000 

tcagtttgtt tctctgcacc tttcttaagg aacatgatca acttatagaa aaaagattaa 3060 

atctcaggga aactcttatg gaggcccttc attatatgtt gttggtatct gaagtagaag 3120 

aaactgaaat ctttaaaatt tgtcttgaat actggaatca tttggctgct gaactctata 3180 

gagagagtcc attctctaca tctgcctctc cgttgctttc tggaagtcaa cattttgatg 3240 

ttcctcccag gagacagcta tatttgccca tgttattcaa ggtaacagag cggttggttg 33 00 

agtgttcttc ctgttgcata ctgtggtttt gaggtctgaa tccaaatact tctaatctgt 3360 

gtaaataaat tagctataaa aagagaaccc aacaacttct ccatgagtgt ggaaaactag 3420 

aacatgaaag gagttgagtc tagaaccttg attctcaaga gtgtggtcct tctctcagta 3480 

tcaacattgg ttgtgatttc gttaggcaaa ttcattggcc acctgccaat ctactaaacc 3540 

agagtctagg aatgagacac aggaaactcc tgtaacagaa gttggttaaa aaaatcacat 3600 

taaaacacac ttaaataatt ataaagccat ttttgtagaa ttacagtgaa aaaaaatttt 3660 

ttcttttgga gacagggtct tgctctgtgg ctcaggttgg agtgcagtgg cgtggtcata 3720 

gctcactaca atcttgaact cctagactaa agcaattctc ttgtgtcagc ctcccaagta 3780 

attggttgca ggcacacacc accatgccta gcaaatttta aattttttag ttttacttca 3840 

tttatattta catagttttt tctgttgtgc tttaagcccc agtattttta ttgttttaga 3900 

gattgggtgt cattttgtgt tgctcaagct gctctcaaat tcctggccac aacagttctt 3960 

tttgcctcgg ccctccggaa tagttggaat tataggcatg aatccctaca catggctggc 4020 

ttgtggtttt tttggttttg tttgttttta actaagcttc tacactaatg atttcagtga 4080 

tctaacgatt tttttcttga ttgatatatt gttgagtata ggtccgttta ttaatggtta 4140 

gtcgaatggc taaaccagag gaagtattgg ttgtagagaa tgatcaagga gaagttgtga 4200 

gagaattcat gaaggataca gattccataa atttgtataa gaatatgagg gaaacattgg 4260 

gtaagttaat aaatactgtt aatctttacc tttataaatc tattctgggg tggaaataca 4320 
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ttttaggaaa tgtacttgta gtttctaaat attttctaaa tgtattgtta ctcatcaata 4380 

aaattctgct cattatttgc atttgaaacc atgtttttct ttgtagttta tcttactcat . 4440 

ctggattatg tagatacaga aagaataatg acagagaagc ttcacaatca agtgaatggt 4500 

acagagtggt catggaaaaa tttgaataca ttgtgttggg caataggctc cattagtgga 4560 

gcaatgcatg aagaggacga aaaacgattt cttgttactg ttataaagga tctattagga 4620 

ttatgtgaac agaaaagagg caaagataat aaagctatta ttgcatcaaa tatcatgtac 4680 

atagtaggtc aatacccacg ttttttgaga gctcactgga aatttctgaa gactgtagtt 4740 

aacaagctgt tcgaattcat gcatgagacc catgatggag tccaggatat ggcttgtgat 4800 

actttcatta aaatagccca aaaatgccgc aggcatttcg ttcaggttca ggttggagaa 4860 

gtgatgccat ttattgatga aattttgaac aacattaaca ctattatttg tgatcttcag 4920 

cctcaacagg ttcatacgtt ttatgaagct gtggggtaca tgattggtgc acaaacagat 4980 

caaacagtac aagaacactt gatagaaaag tacatgttac tccctaatca agtgtgggat 5040 

agtataatcc agcaggcaac caaaaatgtg gatatactga aagatcctga aacagtcaag 5100 

cagcttggta gcattttgaa aacaaatgtg agagcctgca aagctgttgg acaccccttt 5160 

gtaattcagc ttggaagaat ttatttagat atgcttaatg tatacaagtg cctcagtgaa 5220 

aatatttctg cagctatcca agctaatggt gaaatggtta caaagcaacc attgattaga 5280 

agtatgcgaa ctgtaaaaag ggaaacttta aagttaafcat ctggttgggt gagccgatcc 5340 

aatgatccac agatggtcgc tgaaaatttt gttccccctc tgttggatgc agttctcatt 5400 

gattatcaga gaaatgtccc agctgctaga gaaccagaag tgcttagtac tatggccata 5460 

attgtcaaca agttaggggg acatataaca gctgaaatac ctcaaatatt tgatgctgtt 5520 

tttgaatgca cattgaatat gataaataag gactttgaag aatatcctga acatagaacg 5580 

aactttttct tactacttca ggctgtcaat tctcattgtt tcccagcatt ccttgctatt 5640 

ccacctacac agtttaaact tgttttggat tccatcattt gggctttcaa acatactatg 5700 

aggaatgtcg cagatacggg cttacagata ctttttacac tcttacaaaa tgttgcacaa 5760 

gaagaagctg cagctcagag tttttatcaa acttattttt gtgatattct ccagcatatc 5820 

ttttctgttg tgacagacac ttcacatact gctggtttaa caatgcatgc atcaattctt 5880 

gcatatatgt ttaatttggt tgaagaagga aaaataagta catcattaaa tcctggaaat 5940 

ccagttaaca accaaatctt tcttcaggaa tatgtggcta atctccttaa gtcggccttc 6000 

cctcacctac aagatgctca agtaaagctc tttgtgacag ggcttttcag cttaaatcaa 6060 

gatattcctg ctttcaagga acatttaaga gatttcctag ttcaaataaa ggaatttgca 6120 

ggtgaagaca cttctgattt gtttttggaa gagagagaaa tagccctacg gcaggctgat 6180 
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gaagagaaac ataaacgtca 


aatgtctgtc 


cctggcatct 


ttaatccaca 


tgagattcca 


6240 


gaagaaatgt gtgattaaaa 


tccaaattca 


tgctgttttt 


tttctctgca 


actcgttagc 


6300 


agaggaaaac agcatgtggg 


tatttgtcga 


ccaaaatgat 


gccaatttgt 


aaattaaaat 


6360 


gtcacctagt ggcccttttt 


cttatgtgtt 


tttttgtata 


agaaattttc 


tgtgaaatat 


6420 


ccttccattg tttaagcttt 


tgttttggtc 


atctttattt 


agtttgcatg 


aagttgaaaa 


6480 


ttaaggcatt tttaaaaatt 


ttacttcatg 


cccatttttg 


tggctgggct 


ggggggagga 


6540 


ggcaaattcg atttgaacat 


atacttgtaa 


ttctaatgca 


aaattataca 


atttttcctg 


6600 


taaacaatac caatttttaa 


ttagggagca 


ttttccttct 


agtctatttc 


agcctagaag 


6660 


aaaagataat gagtaaaaca 


aattgcgttg 


tttaaaggat 


tatagtgctg 


cattgtctga 


6720 


agttagcacc tcttggactg 


aatcgtttgt 


ctagactaca 


tgtattacaa 


agtctctttg 


6780 


gcaagattgc agcaagatca 


tgtgcatatc 


atcccattgt 


aaagcgactt 


caaaaatatg 


6840 


ggaacacagt tagttatttt 


tacacagttc 


tttttgtttt 


tgtgtgtgtg 


tgctgtcgct 


6900 


tgtcgacaac agctttttgt 


tttcctcaat 


gaggagtgtt 


gctcatttgt 


gagccttcat 


6960 


l_Q.CJ.V_. L.Cy QCiy L-VJ CLCLCZ LmMUL 


adaaaLaL L. L. 








TATA 


caactcatta aaaaacaaaa 


caactctggc 


ttttgagatg 


acfctatacta 


atttacattg 


7080 


tttaccaagc tgtagtgctt 


taagaacact 


acttaaaaag 


caaaataaac 


ttggtttaca 


7140 


ttta 










7144 


<210> 82 
<211> 1001 
<212> DNA 
<213> Homo sapien 












<400> 82 

gtgcccgatc gcagcaacta 


ttccggcagc 


tatcattgct 


tttgacgata 


aatcgtgtcc 


60 


ctccagtaca atcatagaga 


tatgtagctg 


cactacctta 


cgcatttcca 


gatccccgat 


120 


cgcatctcta ctgtccgtcg 


ctttacattg 


ctgctgtcct 


tcctgcttag 


acgcctggtg 


180 


actctcattt cgcagcaggc 


cacgctgctg 


gcctccaatg 


aagcctttaa 


aaagcaggcg 


240 


gagagtgcta gtgaggcggc 


caagaagtac 


atggaggaga 


atgaccagct 


caagaaggga 


300 


gctgctgttg acggaggcaa 


gttggatgtc 


gggaatgctg 


aggtgaagtt 


ggaggaagag 


360 


aacaggagcc tgaaggctga 


cctgcagaag 


ctaaaggacg 


agctggccag 


cactaagcaa 


420 


aaactagaga aagctgaaaa 


ccaggttctg 


gccatgcgga 


agcagtctga 


gggcctcacc 


480 


aaggagtacg accgcttgct 


ggaggagcac 


gcaaagctgc 


aggctgcagt 


agatggtccc 


540 


atggacaaga aggaagagta 


agggcctcct 


tcctcccctg 


cctgcagctg 


gcttccacct 


600 



WO 2004/050858 



125/383 



PCT/US2003/038808 



ggcacgtgcc 


tgctgcttcc 


tgagagcccg 


gcctctccct 


ccagtacttc tgtttgtgcc 


660 


cttctgcttc 


ccccattccc 


ttccacagct 


catagctcgt 


catctcggcc cttgtccaca 


720 


ctctccaagc 


acattacagg 


ggacctgatt 


gctacacgtt 


cagaatgcgt ttgctgtcat 


780 


cctgcttggc 


c t~ act ecu nnc 






acgcccgagc gcggagagca 


q a n 


cgagttagtt 


gtagtccggc 


ttgcggtggg 


gctgacttcc 


tgttggtttg agcccctttt 


900 


tgttttgccc 


tctgggtgtt 


ttctttggtc 


ccgcaggagg 


gtgggtggag caggtggact 


960 


ggagtttctc 


ttgagggcaa 


taaaagttgt 


catggtgtgt 


a 


1001 


<210> 83 

<211> 2419 

<212> DMA 

<213> Homo sapien 










<400> 83 
gccatggatc 


ccaacfcgctc 


ctgcgccgcc 


ggtaagaggc 


tgggg at gcc cagtgtagac 


60 


tqtaqcacta 


gagaagcaat 


ttctgacccc 


tctttctttc 


tctggtcact caatttcagg 


120 


acaqqaqttq 


ctccttccca 


aagagttttg 


gggtatcttt 


ctctccattc taggttattc 


180 


ggagccccct 


ttttaccgtt 


aaggagatct 


gagttaatgg 


cttgctcaag ttcccaggaa 


240 


tcqqttqtQQ 


actgaggaac 


tcggccccgg 


gctcttagta 


cctgcccggg cggtcgggcg 


300 


qqqccqqqqc 




gagttcggtt 


gctgcggcgg 


ggcctgcacg ttgactgtgg 


360 


gaaactcggt 


gagcgggctc 


cgcgcgccgg 


gctgggctcc 


gggaccgcgg aggctccccg 


420 


gcccatcgac 


gagggagaga 


ggcgagcggc 


gcggggaggc 


ccgggggccg gggaatctcg 


480 


gggcccgcag 


cctacctgcg 


tgtgagaggc 


aggggcgagg 


ggaaaccggg cggcgcctct 


540 


agaagctggt 


ctcgggggcg 


gggcgggggg 


cggtgaagcc 


ccacctggag gagcggccgg 


600 


gatgggcctc 


cgggacggtg 


tgccaggccg 


gggccaagtc 


ggaggcccct cgctctgggt 


660 


gggcgctggg 


gcccgcgagg 


gctactgtaa 


ggacccctgg 


cttctgagga tactgcgtct 


720 


agaactttct 


ccgtatgggg 


ccttgaggtg 


cttggtcgag 


acctgccttt gcgcttggtc 


780 


ccgaatcctg 


ccctctagga 


gtcgctcttg 


cgggcctcca 


gcccaccgga ggcgaagcgg 


840 


ccccgggcgg 


aaggccgctg 


gatcctcgag 


ggaggtgccg 


gtttctctcc gcgggcgccg 


900 


tggggacggt 


gggaggcggg 


ggcgtcggca 


gcgcttggac 


taggtgcggc cttgggcctg 


960 


cctggtagcg 


gggatttggg 


cccgcagagc 


gcccgcctct 


gcggctgagt tctgcctggc 


1020 


ggggaaggga 


gcgcccgatg 


ggtgccgagg 


cgtcctcctc 


ttggtgccct ggcactgctc 


1080 


ttcccgaaga 


acgcctttca 


gttaaacggg 


cgtcggaaat 


ctcgggcttc ctggggcagg 


1140 


gatcgtcggg 


agaggccgct 


ctggacgtgt 


tgacacacgt 


gctggagggg gcaggaaaca 


1200 
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agctcacatc 


ttcctgtggg aaaccttcta gcaacaggat 


gagtctgcag 


tggactgcag 


1260 


ttgccacctt 


cctctatgcg gaggtctttg ttgtgttgct 


tctctgcatt 


cccttcattt 


1320 


ctcctaaaag 


atggcagaag attttcaagt cccggctggt 


ggagttgtta 


gtgtcctatg 


1380 


gcaacacctt 


ctttgtggtt ctcattgtca tccttgtgct 


gttggtcatc 


gatgccgtgc 


1440 


gcgaaattcg gaagtatgat gatgtgacgg aaaaggtgaa 


cctccagaac 


aatcccgggg 


1500 


ccatggagca 


cttccacatg aagcttttcc gtgcccagag 


gaatctctac 


attgctggct 


1560 


tttccttgct 


gctgtccttc ctgcttagac gcctggtgac 


tctcatttcg 


cagcaggcca 


1620 


cgctgctggc 


ctccaatgaa gcctttaaaa agcaggcgga 


gagtgctagt 


gaggcggcca 


1680 


agaagtacat 


ggaggagaat gaccagctca agaagggagc 


tgctgttgac 


ggaggcaagt 


1740 


tggatgtcgg gaatgctgag gtgaagttgg aggaagagaa 


caggagcctg 


aaggctgacc 


1800 


tgcagaagct 


aaaggacgag ctggccagca ctaagcaaaa 


actagagaaa 


gctgaaaacc 


1860 


aggttctggc 


catgcggaag cagtctgagg gcctcaccaa 


ggagtacgac 


cgcttgctgg 


1920 


aggagcacgc 


aaagctgcag gctgcagtag atggtcccat 


ggacaagaag 


gaagagtaag 


1980 


ggcctccttc 


ctcccctgcc tgcagctggc ttccacctgg 


cacgtgcctg 


ctgcttcctg 


2040 


agagcccggc 


ctctccctcc agtacttctg tttgtgccct 


tctgcttccc 


ccattccctt 


2100 


ccacagctca 


tagctcgtca tctcggccct tgtccacact 


ctccaagcac 


attacagggg 


2160 


acctgattgc 


tacacgttca gaatgcgttt gctgtcatcc 


tgcttggcct 


ggccaggcct 


2220 


ggcacagcct 


tggcttccac gcctgagcgt ggagagcacg 


agttagttgt 


agtccggctt 


2280 


gcggtggggc 


tgacttcctg ttggtttgag cccctttttg 


ttttgccctc 


tgggtgtttt 


2340 


ctttggtccc 


gcaggagggt gggtggagca ggtggactgg 


agtttctctt 


gagggcaata 


2400 


aaagttgtca 


tggtgtgta 






2419 



<210> 84 

<211> 1487 

<212> DNA 

<213> Homo sapien 

<400> 84 



tctcgcccct 


cggcggggcc cctcccgcac cacagagacg 


tgggacggcc 


gcgcggacta 


60 


gagaagcggc 


cctcgccggc ccgacaaagc cccgccccgc 


cccgcccgcg 


tgctcgctcc 


120 


accccgccct 


gctcgcccgg agcccgaggc gcctcttcct 


ctcccgctcg 


gagcgccggc 


180 


cctccgctcc 


ggggcggggt gggaagcgga ggtgggcgga 


gcctccgggg 


cctcgcgagg 


240 


gctggtgggc ggtgcccggc gggcgcttcc ggtttccggc 


cgcggtatga 


ggggcggggc 


300 


cggggctgct 


gtgggagagt tcggttgctg cggcggggcc 


tgcacgttga 


ctgtgggaaa 


360 
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ctcggaaaca 


agctcacatc ttcctgtggg aaaccttcta gcaacaggat gagtctgcag 


420 


tggactgcag 


ttgccacctt cctctatgcg gaggtctttg ttgtgttgct tctctgcatt 


480 


cccttcattt 


ctcctaaaag atggcagaag attttcaagt cccggctggt ggagttgtta 


540 


gtgtcctatg gcaacacctt ctttgtggtt ctcattgtca tccttgtgct gttggtcatc 


600 


gatgccgtgc 


gcgaaattcg gaagtatgat gatgtgacgg aaaaggtgaa cctccagaac 


660 


aatcccgggg ccatggagca cttccacatg aagcttttcc gtgcccagag gaatctctac 


720 


attgctggct 


fcttCCfcfcCTCt OTC tOtCCt fc C Of" act" t~ arrar' rrrr'hortl-rfan f- / -,i___f-i_i__^. 


780 


cagcaggcca 


cgctgctggc ctccaatgaa gectttaaaa ageaggegga gagtgctagt 


840 


gaggcggcca 


agaagtacat ggaggagaat gaccagctca agaagggagc tgctgttgac 


900 


ggaggcaagt 


tggatgtcgg gaatgetgag gtgaagttgg aggaagagaa caggagectg 


960 


aaggctgacc 


tgeagaaget aaaggacgag ctggccagca ctaagcaaaa actagagaaa 


1020 


gctgaaaacc 


aggttctggc catgeggaag cagtctgagg gcctcaccaa ggagtacgac 


1080 


cgcttgctgg 


aggagcaege aaagctgcag agcccggcct ctccctccag tacttctgtt 


1140 


tgtgcccttc 


tgcttccccc attcccttcc acagctcata gctcgtcatc tcggcccttg 


1200 


tccacactct 


ccaagcacat tacaggggac ctgattgeta caegttcaga atgcgtttgc 


1260 


tgtcatcctg 


cttggcctgg ccaggcctgg cacagccttg gcttccacgc ctgagcgtgg 


1320 


agagcacgag ttagttgtag tccggcttgc ggtggggctg acttcctgtt ggtttgagcc 


1380 


cctttttgtt 


ttgccctctg ggtgttttct ttggtcccgc aggagggtgg gtggagcagg 


1440 


tggactggag tttctcttga gggcaataaa agttgtcatg gtgtgta 


1487 


<210> 85 
<211> 1656 
<212> DNA 
<213> Homo 


sapien 





<220> 

<221> misc_f eature 

<222> (7).. (7) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (14).. (14) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (76) . . (76) 

<223> n=a, c, g or t 
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<220> 

<221> misc_f eature 

<222> (88) . . (88) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (204) . . (204) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (220) . . (220) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (385) . . (385) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (548) . . (548) 

<223> n=a, c, g or t 



<400> 85 

tgattcnttc cccnttttcc tacacccccc ccttcccccc gaccggagga gccgctcttt 60 

ccgcgcgggg cattcngggg cccgaggncg agcccgccgc cgccgccgtc gcccgaggga 12 0 

agcgagaaga ggccgcgacc ggagagaaaa agcggagtcg ccaccggaga gaagtcgaca 180 

cccaagcagc agccgccgcc aganaggccc gcccaccagn ccgcccgtcc ccccgccccg 240 

tccacaaagc agcccgcctc cgcaaagggg tacgaacgaa gggacaaagt ctccaatgaa 300 

aaaagtgtag aagacagaaa ggaagtaaaa gaaaaaaaag aaaaaaccaa acaaacaatc 360 

agttgtcccg gaggaagtga aaaanaaaag cattaaaaag aaaaagaaca ttaacacaga 420 

attgaaccaa aagaatccaa gcaaaaaaga acggacagaa caaattaaag ggggaaacga 48 0 

aaaggagaaa ccggccaggc aaggccaagg gaaacaaaag aaagggcaaa gccaaaaagg 54 0 

ccaagggncg acatcaaaaa agggaaagac wgggaagawg awtcagatga agacatgtct 600 

aattttgatc gtttctctga gatgatgaac aacatgggtg gtgatgagga tgtagattta 660 

ccagaagtag atggagcaga tgatgattca caagacagtg atgatgaaaa aatgccagat 720 

ctggagtaag gaatattgtc atcacctgga ttttgagaaa gaaaaataac ttctctgcaa 780 

gatttcataa ttgagagaat tcctgagttg atagctctaa aggcagatat gctgtatttg 840 
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cctactttaa cccatttttc aacctgtttg ttttttaaaa ggcttcacta agggttgata 900 
tgtaccattg tatggggcaa ttttaagtca gctaaaggca ataaccttta ttgcatgaaa 960 

catttccccc agactttttc atgaaagctt gttgagggtt ccctaggggc acaattcaat 1020 

tgcagccagg tttgcggaat taaaatttaa aaaaccattt cttcaaacct taaaggttcc 1080 

ttcccttttc cttttccatt taaacccttt aggaattact tggaccttgg atttcgggag 1140 

aaggcatcct ctccaggcat cttaagggcc gcacaggaca cgcacactct cactttttta 1200 

ggaaatcttc attctggaac caccttgggt ttttccccaa caaggttggc tcctggggga 1260 

ccccaactgt tggggtgtgt tgacgacacc cctttttagc tgggcgtgaa caccgtgggg 1320 

gaaaagtctc ctcccgcgag gggtgctgat cccctttctc cggggtagtg ggggtttccc 1380 

ccgcttttga aacagtcttc ctcttgtctt cttcccacaa gagaccattc ttttccctgt 1440 

ctctgcccac tgtgttagtc ctcctcttgt gggggtccgc ggggggtcgg tgagcgtgcg 1500 

tgctcccggt ggggtgttcc ccgacggggg gcgtcttctt tacgggctcg tacacctcgc 1560 

tcgagggaaa tgtgtgcgcc cttaaacgtt tctggacgga ggggggcttg ggaggaaagc 1620 

ctttttctac acacgggggg gtctctctgt tgggac 1656 

<210> 86 
<211> 1448 
<212> DNA 
<213> Homo sapien 

<400> 86 

caaatggaca tgtctcttga ctggacatca ttaaactgaa ccggagccag cgaggcggct 60 

ggggcgggtg ccggggcctg cgtgccgtgt gccggctccc agttgcggsc cgcggyggtg 120 

gggcgcagcc gccgragtga atcgaggcgg cggcccatcc ggaaccggcc gkscatcgcc 180 

cgcggcgccg ggakkgcggc aggaaccgac cggcgcccta cagcaggcca aaacaacttc 240 

ccgacaagtg gcagcacgat cttttcgaca gtggcttcgg cggtggtgcc ggcgtggaga 300 

caggtgggaa accgtgcgtg gtgtccatct ggatwttgga gtctcagacg ccgatattca 360 

ggaactcttt gctgaatttg gaacgctgaa gaaggcggct gtgcactatg atcgctctgg 420 

tcgcagctta ggaacagcag acgtgcactt tgagcggaag gcagatgccc tgaaggccat 4 80 

gaagcagtac aacggcgtcc ctctggatgg ccgccccatg aacattcagc ttgtcacgtc 540 

acagattgac gcacagcgga ggcctgcaca gagcgtaaac agaggtggca tgactagaaa 600 

ccgtggcgct ggaggtattg gtggtgatgg aggcacccgg agaggcaatg ctcgtatctt 660 

gtgtggaaga ggcagaggtg ccggcaggaa ttcaaagcag cagctttcgg cagaggagct 720 

ggatgcccag ctggacgcct ataatgcgag aatggacacc agttaaacag acyrgcrwat 780 
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ctygtgygga 


acaggaccca 


ggcgtctcct 


cttgctccct 


ggtaaggcgg 


ggcgtggtgg 


840 


ctgtggctgt 


gcggccaatg 


atggatttgt 


ttctttgatg 


ttattaaaat 


atggafcttaa 


900 


gaaactcatg 


taacaggttc 


ttctctattt 


atttgcttct 


aatatatttt 


ttatagcttc 


960 


tcgaaaccca 


cacctgtgtt 


ggttgacccc 


gagtgttcct 


tattgtcggg 


cgattcaaca 


1020 


cttgtacacc 


tgggtagtgc 


gagtgatggt 


tgcgtgagca 


caaactgggc 


gcgcgcgcgt 


1080 


atcgcccgac 


gcattaatcg 


cccacatcag 


tattataact 


acttaggaac 


atacggtcgt 


1140 


gtcatagaca 


cafcttgtggg 


gaacaattat 


aaccatatag 


gatctaggag 


acacaaccag 


1200 


99 t 999 c 9 c 9 


cgcgccagga 


gcfcagafcaga 


4— /t 4~ f» 

ddCCayatCC 


ccccggacag 


tagcggggac 


T O C f\ 

±Zo 0 


gaccaggcac 


ctgtgcacga 


cgagagtacg 


ccgaaggttc 


ttgtaattgg 


tgaaaagtac 


1320 


atacaagtta 


tggggcgcca 


ataggtaata 


gagtgggagt 


cgcagataga 


atatgtagaa 


1380 


ggcccaagcg 


agcgggatgc 


gaggcgcggg 


gtgtatgaat 


gcttcaccca 


agggtcgtag 


1440 



gtttagtg ' 1448 



<210> 87 

<211> 1594 

<212> DNA 

<213> Homo sapien 

<400> 87 



tggcgtgggt 


aagaccagca 


ttgctcgctc 


catcagcccg 


cgccctgaac 


acgagagtac 


60 


attccagctt. 


cagcgtcggg 


ggcatgacat 


gacagtggct 


gagatcaagg 


gcacaacagg 


120 


caggacacta 


cgtgggcagc 


catgcccagg 


gaagatcatc 


caagtgtttg 


aagaagacca 


180 


agacggagaa 


ccccactgat 


cctcatcaga 


cgaggtggac 


aagatcaggc 


cagaggctac 


240 


acagggggac 


ccagtcagtc 


aggcaacatg 


catggagcat 


gcatggaccc 


aagagcaaga 


300 


atgccaaaca 


ttccatggac 


caacatacac 


atggacagtg 


ccacgtggac 


attgtcacaa 


360 


ggtgcatgtt 


catctgcacg 


gccaacgtca 


cggacaccat 


ccccgagccg 


ctgcgagacc 


420 


gtatggagat 


gatcaacgtg 


tcgggctaag 


tggsccagga 


gaagctggcc 


attgcggagc 


480 


rgctaactgg 


tgccccrggy 


mtcgcgcccw 


tgtgtggcwt 


tggatgagag 


caaggccaag 


540 


ctgtcatcgg 


acgtgctgac 


gctgctcatc 


aagcagtact 


gccgcgagag 


cggtgtccgc 


600 


aacctgcaga 


agcaagtgga 


gaaggtgtta 


cggaaatcgg 


cctacaagat 


tgtcagcggc 


660 


gaggccgagt 


ccgtggaggt 


gacgcccgag 


aacctgcagg 


acttcgtggg 


gaagcccgtg 


720 


ttcaccgtgg 


agcgcatgta 


tgacgtgaca 


ccgcccggcg 


tggtcatggg 


gctggcctgg 


780 


accgcaatgg 


gaggctccac 


gctgtttgtg 


gagacatccc 


tgagacggcc 


acaggacaag 


840 


gatgccaagg 


gtgacaagga 


tggcagcctg 


gaggtgacag 


gccagctggg 


ggaggtgatg 


900 
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aaggagagcg cccgcatagc ctacaccttc gccagagcct tcctcatgca gcacgccccc 960 

gccaatgact acctggtgac ctcacacatc cacctgcatg tgcccgaggg cgccaccccc 102 0 

aaggacggcc caagcgcagg ctgcaccatc gtcacggccc tgctgtccct ggccatgggc 1080 

aggcctgtcc ggcagaatct ggccatgact ggcgaagtcfc ccctcacggg caagatcctg 1140 

cctgttggtg gcatcaagga gaagaccatt gcggccaagc gcgcaggggt gacgtgcatc 1200 

gtcctgccag ccgagaacaa gaaggacttc tacgacctgg cagccttcat caccgagggc 1260 

ctggaggtgc acttcgtgga acactaccgg gagatcttcg acatcgcctt cccggacgag 1320 

caggcagagg cgctggccgt ggaacggtga cggccacccc gggactgcag gcggcggatg 13 8 0 

tcaggccctg tctgggccag aactggatgt caggccctgt ctgggccaga actgagcgct 144 0 

9tggggagcg cgcccggacc tggcagtgga gccaccgagc gagcagctcg gtccagtgac 1500 

ccagatccca gggacctcag tcggcttaat cagagtgtgg catagaagct atttaatgat 1560 

taaagtcatt tgcagfcaaca cacacaacca caaa 1594 

<210> 88 

<211> 991 

<212> DNA 

<213> Homo sapien 

<400> 88 

cgccttctgt gtccgggtgg acatgtctct ggcaattcta gaagaggtaa taatgtggga 60 

tcctagggtg tcccggccgt gcctcccatt ggggatgtgg ccacccagta ctttgctgac 120 

agggacatgt tctgtgctgg ccgagtacct gaggaggatc tgaagaggac aatgatggcc 180 

tgtggaggct caatccagac cagtgtgaat gctctgtcag cagatgtgct. tggtcgatgc 240 

cagttgtttg aagagaccca gattggaagc gagaggtaca atcttttgac tgctgcccca 300 

rgccaagaca tgcaccttca ttctccgtgg cggcaccgag cagtttatgg aggagacaga 360 

gcggtccctg catgatgcca tcatgatcgt cakgagggcc atcaagaatg attcagtggt 420 

rgctggtggc ggggccattr agatggaact ctccaagtac ctgcgggatt actcaaggac 480 

tattccagga aaacagcagc tgttgattgg ggcttatgcc aaggccttgg agattatccc 540 

acgccagctg tgtgacaatg ctggctttga tgccacaaac attctcaaca agctgcgggc 600 

tcggcatgcc caggggggta catggtatgg agtagacatc aacaacgagg acattgctga 660 

caactttgaa gctttcgtgt gggagccagc tatggtgcgg atcaatgcgc tgacagcagc 720 

ctctgaggct gcagtgcctg atcgtgtctt gtagatgaaa ccatcaagaa cctcdccgtc 780 

tcgactgtgg atgctcccac atgcatgcag gaccggggtc tgtggtctgt ggccgccccc 84 0 

atctgagagg caccccaccc atcacatggc tggctggctg ctgggtgcac ttaccctcct 900 
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tggcttggtt acttcatttt acaaggaagg ggtagtaatt ggcccactct cttcttactg 960 
gaggctattt aaataaaagg taagacttcc a 991 



<210> 89 

<211> 944 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 

<222> (26).. (26) 

<223> n=a, c, g or t 



<220> 

< 2 2 1 > mi sc_f ea ture 

<222> (185) . , (185) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (189) . . (189) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (197) . . (197) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (206) . . (206) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (216) . . (216) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (228) . . (228) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (231) . . (231) 

<223> n=a, c, g or t 



<220> 
<221> 
<222> 



misc_f eature 
(233) . . (233) 
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<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (241) . . (241) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (246) . . (248) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (260) . . (260) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (274) . . (276) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (279) . . (280) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (288) . . (288) 

<223> n=a, c, g or t 



<220> 

<221> mi sc_f eature 

<222> (296) . . (296) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (302) . . (302) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (310) . . (310) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (318) . . (318) 

<223> n=a, c, g or t 
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<220> 
<221> 
<222> 
<223> 



mis cofeature 
(341) . . (341) 
n=a, c, g or t 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(343) . . (343) 
n=a, c, g or t 



<220> 
<221> 
<222> 
<223> 



mi sc_f eature 
(350) . . (350) 
n=a, c, g or t 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(363) . . (363) 
n=a, c, g or t 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(378) . . (378) 
n=a, c, g or t 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(404) . . (406) 
n=a, c, g or t 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(411) . . (411) 
n=a f c, g or t 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(418) . . (418) 
n=a, c, g or t 



<220> 
<221> 
<222> 
<223> 



mi sc_f eature 
(433) . . (433) 
n=a, c, g or t 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(436) . . (436) 
n=a, c, g or t 
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<221> misc_f eature 
<222> (478) . . (478) 
<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (484) . . (484) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (505) . . (505) 

<223> n=a, c, g or t 



<220> 

<221> raisc_f eature 

<222> (511) . . (511) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (516) . , (516) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (529) . . (529) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (854) . . (854) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (926) . . (926) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (270) . . (270) 

<223> n=a, c, g or t 



<400> 89 

gtcacccttt tcctgcacaa caacgnggtc ttccatgatc cccacatcgc tgactcctgg 60 

ccgcagctcc aggaccactt ctcagctgtg ragtcgtggg gccggaggag gctctgagcc 120 

caggcgaggc cagggacatg gccatggacc tgtgtcggca ggaccccgag tgtgagttct 180 
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acttnagcnt 


ggacgcngac 


gctgtnctca 


ccaacntgca 


gaccctgngt ntnctcattg 


240 


nggagnnnag 


gggggtgatn 


gctcccatgn 


tgtnnnggnn 


gggcaagntg tggtcnaact 


300 


tntggggcgn 


cctgagcncc 


gatgagtact 


acgcccgctc 


ngnggactan gtggagctgg 


360 


tgnagcggaa 


gcgatgtngg 


tgtgtggaat 


gttccatata 


tctnnnaggc ntatgtgntg 


420 


gggggtgata 


ccntgnggat 


ggagctgccc 


cagagggatg 


tgttctcggg cagtgacnca 


480 


gacncggaca 


tggccttctg 


ttagngcttt 


ngaganaagg 


gcatcttcnt ccatctgagc 


540 


tatcagcatg 


aatttggccg 


gctcctggcc 


acttccagat 


acgacacgga gcacctgcac 


600 


cccgacctct 


ggcagafcctt 


cgacaacccc 


gtcgactgga 


aggagcagta catccacgag 


660 


aactacagcc 


gggccctgga 


agggaaggaa 


tcgtggagca 


gccatgcccg gacgtgtact 


720 


ggttcccact 


gctgtcagaa 


caaatgtgtg 


atgagctggt 


ggcagagatg gagcactacg 


780 


gccagtggtc 


aggcggccgg 


catgaggatt 


caaggctggc 


tggaggctac gagaatgtgc 


840 


ccaccgtgga 


catnccacat 


gaagcaggtg 


gggtacgagg 


accagtggct gcagctgctg 


900 


cggacgtatg 


tgggcccatg 


accganacct 


gtttccggtt 


taca 


944 


<210> 90 

<211> 973 

<212> DNA 

<213> Homo sapien 










<220> 

<221> misc feature 
<222> (1)..(1) 
<223> n=a, c, g or t 










<400> 90 
naagcatagt 


ctaggtacsm 


sgctkrtgmt 


ccccgccgct 


gcagccgtcg ccttcggagc 


60 


gaagggtacc 


gacccggcag 


aagctcggag 


ctctcggggt 


atcgaggagg caggcccgcg 


120 


ggcgcacggg 


cgagcgggcc 


gggagccgga 


gcggcggagg 


agccggcagc agcggcgcgg 


180 


cgggctccag 


gcgaggcggg 


ctccaggcga 


ggcggtcgac 


gctcctgaaa acttgcgcgc 


240 


gcgctcgcgc 


cactgcgccc 


ggagcgatga 


agatggtcgc 


gccctggact cggttctact 


300 


ccaacagctg 


ctgcttgtgc 


tgccatgtcc 


gcaccggcac 


catcctgctc ggcgtctggt 


360 


atctgatcat 


caatgctgtg 


gtactgttga 


ttttattgag 


tgccctggct gatccggatc 


420 


agtataactt 


ttcaagttct 


gaactgggag 


gtgactttga 


gttcatggat gatgccaaca 


480 


tgtgcattgc 


cattgcgatt 


tctcttctca 


tgatcctgat 


atgtgctatg gctacttacg 


540 


gagcgtacaa 


gcaacgcgca 


gcctggatca 


tcccattctt 


ctgttaccag atctttgact 


600 


ttgccctgaa 


catgttggtt 


gcaatcactg 


tgcttattta 


tccaaactcc attcaggaat 


660 
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acatacggca actgcctcct aattttccct acagagatga tgtcatgtca gtgaatccta 720 

cctgtttggt ccttattatt cttctgttta ttagcattat cttgactttt aagggttact 780 

tgattagctg tgtttggaac tgctaccgat acatcaatgg taggaactcc tctgatgtcc 84 0 

tggtttatgt taccagcaat gacactacgg tgctgctacc cccgtatgat gatgccactg 900 

tgaatggtgc ttgccaagga gcaccggcac ttacgtgttg cctaagcctt caagtgggcg 960 

gacctgaggg cag 973 



<210> 91 

<211> 287 

<212> DNA 

<213> Homo sapien 

<400> 91 

aaggagcgga tgtctaatat atggcctggc tctagatgct ccgagcggcc gcagtgtgat 60 

ggattcgagc ggcgcccggg caggtacggt ctgcacccta catgggattc atagaggcca 120 

ccagaaacca ggccttttct tgtccagaag ttccaacgct tcagcagcga tagccaccac 180 

ttacagctcg tccccacaca gtgctgccac agcaggctga cgcaggtcct ccgcccgagc 24 0 

acctccagcg ctgcacgtgc gcacttgcgt ctggactgca cgatccg 287 



<210> 92 

<211> 235 

<212> DNA 

<213> Homo sapien 

<400> 92 

gccagtgtga tggatgcgtg gtcgcggcga ggtacggtct gcaccctaca tgggattcat 60 

agaggccacc agaaaccagg ccttttcttg tccagaagtt ccaacgcttc agcagcgata 120 

gccaccactt acagctcgtc cccacacagt gctgccacag caggctgacg caggtcctcc 180 

gcccgagcac ctccagcgct gcacgtgcgc acttgcgtct ggactgcacg atccg 235 



<210> 93 

<211> 797 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 

<222> (772) . . (772) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (785) . . (786) 

<223> n=a, c, g or t 



WO 2004/050858 



138/383 



PCT/US2003/038808 



<220> 

<221> misc — feature 
<222> (796) . . (796) 
<223> n=a, c, g or t 

<400> 93 

gccgacgaga cctcagaaaa agaacagcaa gaagcgattg aacacattga tgaagtacaa 60 

aatgaaatag acagacttaa tgaacaagcc agtgaggaga ttttgaaagt agaacagaaa 120 

tataacaaac tccgccaacc atttkttcag aagaggtcag aattgatcgc caaaatccca 180 

acatttttgg ggtaacaaca tttgtcaacc atccacaaga tgtctgcact gctctgcggg 240 

aggaagatga agaggcactg cattatttga ccagagttga agtgacagaa tttgaagata 3 00 

ttaaatcagg ttacagaata gagttttdat tttgatggaa atccttactt tgaaaataaa 360 

gttctctccd aagaatttca tctgaatgag agtggtgatc catcttcgaa gtccaccgaa 420 

atcaaatgga aatctggaaa ggatttgacg aaacgttcga gtcaaacgca gaataaagcc 480 

agcaggaaga ggcagcatga ggaaccagag agcttcttta cctggtttac tgaccattct 54 0 

gatgcaggtg ctgatgagtt aggagaggtc atcaaagatg atatttggcc aaacccatta 600 

cagtactact tggttcccga tatggatgat gaagaaggag aaggagaaga agatgatgat 660 

gatgatgarg aggaggaagg attagaagat attgcgaaga aggggatgag gatgaaggtk 720 

argaagatka agatkatkat gwartgggag gaaggacagg aggattaaag anaaaatttc 780 

tttannaacc cccaanc 797 

<210> 94 

<211> 2068 

<212> DNA 

<213> Homo sapien 

<400> 94 

agtccgcccc gccccctccc cgtccccgcc gctgcagccg tcgccttcgg agcgaagggt 60 

accgacccgg cagaagctcg gagctctcgg ggtatcgagg aggcaggccc gcgggcgcac 120 

gggcgagcgg gccgggagcc ggagcggcgg aggagccggc agcagcggcg cggcgggctc 180 

caggcgaggc ggtcgacgct cctgaaaact tgcgcgcgcg ctcgcgccac tgcgcccgga 240 

gcgatgaaga tggtcgcgcc ctggacgcgg ttctactcca acagctgctg cttgtgctgc 300 

catgtccgca ccggcaccat cctgctcggc gtctggtatc tgatcatcaa tgctgtggta 360 

ctgttgattt tattgagtgc cctggctgat ccggatcagt ataacttttc aagttctgaa 420 

ctgggaggtg -actttgagtt catggatgat gccaaaattc tttttaatct ttcggcagac 480 

atgtgcattg ccattgcgat ttctcttctc atgatcctga tatgtgctat ggctacttac 540 
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ggagcgtaca agcaacgcgc agcctggatc atcccattct tctgttacca gatctttgac 600 

tttgccctga acatgttggt tgcaatcact gtgcttattt atccaaactc cattcaggaa 660 

tacatacggc aactgcctcc taattttccc tacagagatg atgtcatgtc agtgaatcct 720 

acctgtttgg tccttattat tcttctgttt attagcatta tcttgacttt taagggttac 780 

ttgattagct gtgtttggaa ctgctaccga tacatcaatg gtaggaactc ctctgatgtc 840 

ctggtttatg ttaccagcaa tgacactacg gtgctgctac ccccgtatga tgatgccact 900 

gtgaatggtg ctgccaagga gccaccgcca ccttacgtgt ctgcctaagc cttcaagtgg 960 

gcggagctga gggcagcagc ttgactttgc agacatctga gcaatagttc tgttatttca 1020 

cttttgccat gagcctctct gagcttgttt gttgctgaaa tgctactttt taaaatttag 1080 

atgttagatt gaaaactgta gttttcaaca tatgctttgc tagaacactg tgatagatta 1140 

actgtagaat tcttcctgta cgattgggga tataacgggc ttcactaacc ttccctaggc 1200 

attgaaactt cccccaaatc tgatggacct agaagtctgc ttttgtacct gctgggcccc 1260 

aaagttgggc atttttctct ctgttccctc tcttttgaaa atgtaaaata aaaccaaaaa 1320 

tagacaactt tttcttcagc cattccagca tagagaacaa aaccttatgg aaacaggaat 138 0 

gtcaattgtg taatcattgt tctaattagg taaatagaag tccttatgta tgtgttacaa 1440 

gaatttcccc cacaacatcc tttatgactg aagttcaatg acagtttgtg tttggtggta 1500 

aaggattttc tccatggcct gaattaagac.. cattagaaag caccaggccg tgggagcagt 1560 

gaccatctgc tgactgttct tgtggatctt gtgtccaggg acatggggtg acatgcctcg 162 0 

tatgtgttag agggtggaat ggatgtgttt ggcgctgcat gggatctggt gcccctcttc 1680 

tcctggattc acatccccac ccagggcccg cttttactaa gtgttctgcc ctagattggt 1740 

tcaaggaggt catccaactg actttatcaa gtggaattgg gatatatttg atatacttct 1800 

gcctaacaac atggaaaagg gttttctttt ccctgcaagc tacatcctac tgctttgaac 1860 

ttccaagtat gtctagtcac cttttaaaat gtaaacattt tcagaaaaat gaggattgcc 1920 

ttccttgtat gcgcttttta ccttgactac ctgaattgca agggattttt atatattcat 1980 

atgttacaaa gtcagcaact ctcctgttgg ttcattattg aatgtgctgt aaattaagtc 2040 

gtttgcaatt aaaacaaggt ttgcccca 2068 

<210> 95 

<211> 211 

<212> PRT 

<213> Homo sapien 

<400> 95 
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Met Ala Ser Gly Gin Gly Pro Gly Pro Pro Arg Gin Glu Cys Gly Glu 
15 10 15 



Pro Ala Leu Pro Ser Ala Ser Glu Glu Gin Val Ala Gin Asp Thr Glu 
20 25 30 



Glu Val Phe Arg Ser Tyr Val Phe Tyr Arg His Gin Gin Glu Gin Glu 
35 40 45 



Ala Glu Gly Val Ala Ala Pro Ala Asp Pro Glu Met Val Thr Leu Pro 
50 55 60 



Leu Gin Pro Ser Ser Thr Met Gly Gin Val Gly Arg Gin Leu Ala lie 
65 70 75 80 



lie Gly Asp Asp lie Asn Arg Arg Tyr Asp Ser Glu Phe Gin Thr Met 
85 90 9.5 



Leu Gin His Leu Gin Pro Thr Ala Glu Asn Ala Tyr Glu Tyr Phe Thr 
100 105 110 



Lys lie Ala Thr Ser Leu Phe Glu Ser Gly lie Asn Trp Gly Arg Val 
115 120 125 



Val Ala Leu Leu Gly Phe Gly Tyr Arg Leu Ala Leu His Val Tyr Gin 
130 135 140 



His Gly Leu Thr Gly Phe Leu Gly Gin Val Thr Arg Phe Val Val Asp 
145 150 155 160 



Phe Met Leu His His Cys He Ala Arg Trp He Ala Gin Arg Gly Gly 
165 170 175 



Trp Val Ser He Gin Gly Leu Gin Cys Pro Pro Cys Cys Trp Gly Cys 
180 185 190 



Pro Ser Pro Arg Pro His Pro Phe Leu Gly Phe Leu Tyr Leu Leu Leu 
195 200 205 



Ala Ser Leu 
210 



<210> 96 

<211> 225 

<212> PRT 

<213> Homo sapien 
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<400> 96 

Glu Trp His Gin Leu Gly Pro Cys Gly Gly Ser Ser Gly Leu Arg Leu 
1 5 10 15 



Pro Ser Gly Pro Thr Arg Leu Pro Ala Trp Pro Asp Trp Leu Pro Arg 
20 25 30 



Pro Gly Asp Pro Leu Arg Gly Arg Leu His Ala Ala Ser Leu His Cys 
35 40 45 



Pro Val Asp Cys Thr Glu Gly Trp Leu Gly Glu Tyr Pro Arg Thr Ala 
50 55 60 



Met Ser Ser Leu Leu Leu Gly Leu Pro Leu Ser Gin Ala Pro Pro Leu 
65 70 75 80 



Pro Arg Val Pro He Ser Ser Ala Ser Leu Ser Val Thr Thr Leu Phe 
85 90 95 



Leu Pro Ala Gly Gly Ser Pro Glu Leu Gly Gin Trp Ser His Pro Glu 
100 105 no 



Arg Ala Gly. Gly Ser Gly Cys Gly Ser Val Gly Pro Val Cys Gly Thr 
13 -5 120 125 



Lys He Leu Gin He Met Thr Pro Lys Gly Ala Leu Trp Gly Pro Gly 
130 135 140 



Ser Asp Pro Cys Leu Asp Leu Ser Glu Val Phe Ala Phe Ser Ala Pro 
145 150 155 160 



Leu Gin Gly Ser Pro Leu Lys Ser Thr Glu Ala Leu Ala Ser Val His 
165 170 175 



Ser Ser Phe Gly Gly Pro Leu Arg Gly Gly Gin Ser Gly Cys Arg Gly 
180 185 ~ 190 



Thr Ser Thr Leu His Gly Ala Ser Gly Pro Ser Leu Trp Ala Gin Gly 
195 200 205 



Leu Trp Pro Ser Pro Pro Ser Ala Leu Trp Asp Leu Leu Ser Pro Val 
210 215 220 



Cys 
225 
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<210> 97 

<211> 284 

<212> PRT 

<213> Homo sapien 

<400> 97 

Met Ala Pro Leu His Ser Ser Leu Gly Asn Lys Arg Asn Ser lie Ser 
15 10 15 



Lys Lys Lys Lys lie Tyr Ser Cys Phe Leu Asn Leu Ala Glu Arg lie 
20 25 30 



Arg lie Pro Glu Pro Trp lie Thr Pro Pro Asp Leu Gin Glu Lys lie 
35 40 45 



His lie Phe Ala Gin Lys Cys Leu Phe Leu Thr Glu Ser Leu Lys Gin 
50 55 60 



Phe Thr Glu Lys Met Gin Ser Asp Met Glu Lys lie Gin Glu Leu Arg 
65 70 75 80 



Glu Ala Gin Leu Tyr Ser Val Asp Val Thr Leu Asp Pro Asp Thr Ala 
85 90 95 



Tyr Pro Ser Leu lie Leu Ser Asp Asn Leu Arg Gin Val Arg Tyr Ser 
100 105 110 



Tyr Leu Gin Gin Asp Leu Pro Asp Asn Pro Glu Arg Phe Asn Leu Phe 
115 120 125 



Pro Cys Val Leu Gly Ser Pro Cys Phe lie Ala Gly Arg His Tyr Trp 
130 135 140 



Glu Val Glu Val Gly Asp Lys Ala Lys Trp Thr He Gly Val Cys Glu 
145 150 155 160 



Asp Ser Val Cys Arg Lys Gly Gly Val Thr Ser Ala Pro Gin Asn Gly 
165 170 175 



Phe Trp Ala Val Ser Leu Trp Tyr Gly Lys Glu Tyr Trp Ala Leu Thr 
180 185 190 



Ser Pro Met Thr Ala Leu Pro Leu Arg Thr Pro Leu Gin Arg Val Gly 
195 200 205 



He Phe Leu Asp Tyr Asp Ala Gly Glu Val Ser Phe Tyr Asn Val Thr 



v] 
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210 215 220 



Glu Arg Cys His Thr Phe Thr Phe Ser His Ala Thr Phe Cys Gly Pro 
225 230 235 " 240 



Val Arg Pro Tyr Phe Ser Leu Ser Tyr Ser Gly Gly Lys Ser Ala Ala 
245 250 255 



Pro Leu lie lie Cys Pro Met Ser Gly He Asp Gly Phe Ser Gly His 
260 265 270 



Val Gly Asn His Gly His Ser Met Glu Thr Ser Pro 
275 280 



<210> 98 

<211> 321 

<212> PRT 

<213> Homo sapien 

<400> 98 

Lys His Lys He Ser Trp Val Trp Trp Arg Met Pro Val He Pro Ala 
15 10 15 



Thr Gin Glu Ala Glu Ala Gly Glu Ser Leu Glu Pro Gly Arg Gin Arg 
20 25 30 



Leu Arg Trp Ala Glu Met Ala Pro Leu His Ser Ser Leu Gly Asn Lys 
35 40 45 



Arg Asn Ser He Ser Lys Lys Lys Lys He Tyr Ser Cys Phe Leu Asn 
50 55 60 



Leu Ala Glu Arg He Arg He Pro Glu Pro Trp He Thr Pro Pro Asp 
65 70 75 80 



Leu Gin Glu Lys He His He Phe Ala Gin Lys Cys Leu Phe Leu Thr 
85 90 95 



Glu Ser Leu Lys Gin Phe Thr Glu Lys Met Gin Ser Asp Met Glu Lys 
100 105 110 



He Gin Glu Leu Arg Glu Ala Gin Leu Tyr Ser Val Asp Val Thr Leu 
115 120 125 



Asp Pro Asp Thr Ala Tyr Pro Ser Leu He Leu Ser Asp Asn Leu Arg 
130 135 140 



■1 
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Gin Val Arg Tyr Ser Tyr Leu Gin Gin Asp Leu Pro Asp Asn Pro Glu 
145 150 155 160 



Arg Phe Asn Leu Phe Pro Cys Val Leu Gly Ser Pro Cys Phe lie Ala 
165 170 175 



Gly Arg His Tyr Trp Glu Val Glu Val Gly Asp Lys Ala Lys Trp Thr 
180 185 190 



He Gly Val Cys Glu Asp Ser Val Cys Arg Lys Gly Gly Val Thr Ser 
195 200 205 



Ala Pro Gin Asn Gly Phe Trp Ala Val Ser Leu Trp Tyr Gly Lys Glu 
210 215 220 



Tyr Trp Ala Leu Thr Ser Pro Met Thr Ala Leu Pro Leu Arg Thr Pro 
225 230 235 240 



Leu Gin Arg Val Gly He Phe Leu Asp Tyr Asp Ala Gly Glu Val Ser 
245 250 255 



Phe Tyr Asn Val Thr Glu Arg Cys His Thr Phe Thr Phe Ser His Ala 
260 265 270 



Thr Phe Cys Gly Pro Val Arg Pro Tyr Phe Ser Leu Ser Tyr Ser Gly 
275 280 285 



Gly Lys Ser Ala Ala Pro Leu He He Cys Pro Met Ser Gly He Asp 
290 295 300 



Gly Phe Ser Gly His Val Gly Asn His Gly His Ser Met Glu Thr Ser 
305 310 315 320 



Pro 



<210> 99 

<211> 50 

<212> PRT 

<213> Homo sapien 

<400> 99 

Thr He Tyr Gly Pro Lys Val Ala Lys He Leu Arg Ala Ser Val Glu 
15 10 15 



Leu Leu Asp Glu Met Lys Phe Ser Leu Glu Lys Leu His Gin Gly He 
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20 25 30 



Thr Val Ser Asp Pro Pro Phe Asp Thr Gin Pro Arg Pro Asp Asp Ser 
35 40 45 



Phe Ser 
50 



<210> 100 

<211> 121 

<212> PRT 

<213> Homo sapien 

<400> 100 

Ser Ser Arg Trp Arg Ser Cys Thr Lys Ala Ser Gin Ser Gin Thr Leu 
15 10 15 



Pro Leu Thr Pro Ser Pro Gly Pro Met Thr Ala Phe Pro Glu Asp Pro 
20 25 30 



Gly His Ala Ala Val Pro Pro His Gly Gin Met Asp Thr Gin Ser Leu 
35 40 45 



Gly Gly His Cys Trp His Gly Val Ser Ala Arg His Leu Pro Pro Ala 
50 55 60 



Pro Pro Asp Gly Pro Thr Arg Gly Cys Ala Asp Val Gly Thr Thr Glu 
65 70 75 80 



Pro Arg Cys Thr Leu Asp Gin Gly Ala Gly Pro Ala Ser Gly Arg Pro 
85 90 95 



Pro Thr Asn Leu Phe Cys Pro Ala Glu Val Val Gly Gly Ala Ser Trp 
100 105 HO 



Gly Ala Arg Phe Pro Gin Leu Trp Val 

120 





115 


<210> 


101 


<211> 


386 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


101 



Met Phe Asp Thr Thr Pro His Ser Gly Arg Ser Thr Pro Ser Ser Ser 
1 5 10 15 
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Pro Ser Leu Arg Lys Arg Leu Gin Leu Leu Pro Pro Ser Arg Pro Pro 
20 25 30 



Pro Glu Pro Glu Pro Gly Thr Met Val Glu Lys Gly Ser Asp Ser Ser 
35 40 45 



Ser Glu Lys Gly Gly Val Pro Gly Thr Pro Ser Thr Gin Ser Leu Gly 
50 55 60 



Ser Arg Asn Phe He Arg Asn Ser Lys Lys Met Gin Ser Trp Tyr Ser 
65 70 75 80 



Met Leu Ser Pro Thr Tyr Lys Gin Arg Asn Glu Asp Phe Arg Lys Leu 
85 90 95 



Phe Ser Lys Leu Pro Glu Ala Glu Arg Leu He Val Asp Tyr Ser Cys 
100 105 HO 



Ala Leu Gin Arg Glu He Leu Leu Gin Gly Arg Leu Tyr Leu Ser Glu 
115 120 125 



Asn Trp He Cys Phe Tyr Ser Asn He Phe Arg Trp Glu Thr Thr He 
130 135 140 



Ser He Gin Leu Lys Glu Val Thr Cys Leu Lys Lys Glu Lys Thr Ala 
145 150 155 160 



Lys Leu He Pro Asn Ala lie Gin He Cys Thr Glu Ser Glu Lys His 
165 170 175 



Phe Phe Thr Ser Phe Gly Ala Arg Asp Arg Cys Phe Leu Leu He Phe 
180 185 190 



Arg Leu Trp Gin Asn Ala Leu Leu Glu Lys Thr Leu Ser Pro Arg Glu 
195 200 205 



Leu Trp His Leu Val His Gin Cys Tyr Gly Ser Glu Leu Gly Leu Thr 
210 215 220 



Ser Glu Asp Glu Asp Tyr Val Ser Pro Leu Gin Leu Asn Gly Leu Gly 
225 230 235 240 



Thr Pro Lys Glu Val Gly Asp Val He Ala Leu Ser Asp He Thr Ser 
245 250 255 



Ser Gly Ala Ala Asp Arg Ser Gin Glu Pro Ser Pro Val Gly Ser Arg 
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260 265 270 



Arg Gly His Val Thr Pro Asn Leu Ser Arg Ala Ser Ser Asp Ala Asp 
275 280 285 



His Gly Ala Glu Glu Asp Lys Glu Glu Gin Val Asp Ser Gin Pro Asp 
290 295 300 



Ala Ser Ser Ser Gin Thr Val Thr Pro Val Ala Glu Pro Pro Ser Thr 
305 310 315 320 



Glu Pro Thr Gin Pro Asp Gly Pro Thr Thr Leu Gly Pro Leu Asp Leu 
325 330 335 



Leu Pro Ser Glu Glu Leu Leu Thr Asp Thr Ser Asn Ser Ser Ser Ser 
340 345 350 



Thr Gly Glu Glu Gly Glu Ala Gly Gly Pro Asn Ser Phe Ala Ser Gly 
355 360 365 



Thr Cys Lys Pro Arg Ser Val Leu Ser Lys Arg Gly Trp He Arg Pro 
370 375 380 



Gin His 
385 



<210> 102 

<211> 190 

<212> PRT 

<213> Homo sapien 

<400> 102 

Met Phe Gly Phe Asn Lys Val Ser Leu Phe Pro Ser Leu Ser Leu Cys 
1 5 10 15 



Pro Ser Ala Phe Pro Ser Leu Ser Ser Pro Ser Leu Pro Gly Phe Ser 
20 25 30 



Leu Leu Leu Gly Leu Cys Gly Pro Ser Pro Pro Ser Leu Pro Gly Trp 
35 40 45 



Gly Arg Arg Arg Asp Ala Ala Gly Arg Gly Lys Glu Gly Gly Val Ala 
50 55 60 



Asp Ala Ala Ala Ala Ser Gly Gly Ser Gly Leu Leu Cys Gly Arg Leu 
65 70 75 80 
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Gly Arg Val Gly Ala Ala Ala Ala Glu Gly Gin Ala Gly Ala Leu Arg 
85 90 95 



Gly Arg Pro Ala Gin Pro Ser Pro Ala Gin Pro Ser Pro Ala Leu Pro 
100 105 110 



Cys Pro Ala Leu Pro Cys Ala Arg Gly Ala Pro Thr Ala Pro His Pro 
115 120 125 



Cys Ser Thr Pro His Pro Thr Leu Ala Gly Ala Arg Gin Ala Ala Pro 
130 135 140 



His Arg Ser Gly Asn Gly Cys Ser Ser Cys Pro Gin Ala Gly Pro His 
145 ~ 150 155 160 



Leu Ser Gin Asn Gin Ala Pro Trp Trp Arg Arg Asp Gin He Ala Pro 
165 170 175 



Gin Arg Arg Val Gly Cys Leu Gly Pro Pro Ala Pro Arg Ala 
180 185 190 



<210> 103 

<211> 442 

<212> PRT 

<213> Homo sapien 

<400> 103 

Arg Arg Gly Gly Leu Arg Arg Leu Arg Pro Phe Val Arg Ala Val Gly 
1 ~ 5 ^ 10 15 



Ser Gly Gly Gly Gly Gly Arg Gly Arg Pro Gly Arg Cys Pro Ala Gly 
20 25 30 



Thr Pro Ser Ala Ala Gin Pro Arg Ala Ala Gin Pro Cys Pro Ala Leu 
35 40 45 



Pro Cys Pro Ala Leu Arg Pro Gly Arg Ala His Arg Ala Ala Ser Met 
50 55 60 



Phe Asp Thr Thr Pro His Ser Gly Arg Ser Thr Pro Ser Ser Ser Pro 
65 70 75 80 



Ser Leu Arg Lys Arg Leu Gin Leu Leu Pro Pro Ser Arg Pro Pro Pro 
85 90 95 



Glu Pro Glu Pro Gly Thr Met Val Glu Lys Gly Ser Asp Ser Ser Ser 
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100 



105 



110 



Glu Lys Gly Gly Val Pro Gly Thr Pro Ser Thr Gin Ser Leu Gly Ser 
115 120 125 



Arg Asn Phe lie Arg Asn Ser Lys Lys Met Gin Ser Trp Tyr Ser Met 
130 135 140 



Leu Ser Pro Thr Tyr Lys Gin Arg Asn Glu Asp Phe Arg Lys Leu Phe 
145 150 155 160 



Ser Lys Leu Pro Glu Ala Glu Arg Leu lie Val Asp Tyr Ser Cys Ala 
165 170 175 



Leu Gin Arg Glu lie Leu Leu Gin Gly Arg Leu Tyr Leu Ser Glu Asn 
180 185 190 



Trp lie Cys Phe Tyr Ser Asn lie Phe Arg Trp Glu Thr Thr lie Ser 
195 200 205 



lie Gin Leu Lys Glu Val Thr Cys Leu Lys Lys Glu Lys Thr Ala Lys 
210 215 220 



Leu He Pro Asn Ala He Gin He Cys Thr Glu Ser Glu Lys His Phe 
225 230 235 240 



Phe Thr Ser Phe Gly Ala Arg Asp Arg Cys Phe Leu Leu He Phe Arg 
245 250 255 



Leu Trp Gin Asn Ala Leu Leu Glu Lys Thr Leu Ser Pro Arg Glu Leu 
260 265 270 



Trp His Leu Val His Gin Cys Tyr Gly Ser Glu Leu Gly Leu Thr Ser 
275 280 285 



Glu Asp Glu Asp Tyr Val Ser Pro Leu Gin Leu Asn Gly Leu Gly Thr 
290 295 300 



Pro Lys Glu Val Gly Asp Val He Ala Leu Ser Asp He Thr Ser Ser 
305 310 315 320 



Gly Ala Ala Asp Arg Ser Gin Glu Pro Ser Pro Val Gly Ser Arg Arg 
325 330 335 



Gly His Val Thr Pro Asn Leu Ser Arg Ala Ser Ser Asp Ala Asp His 
340. 345 350 



WO 2004/050858 



150/383 



PCT/US2003/038808 



Gly Ala Glu Glu Asp Lys Glu Glu Gin Val Asp Ser Gin Pro Asp Ala 
355 360 365 



Ser Ser Ser Gin Thr Val Thr Pro Val Ala Glu Pro Pro Ser Thr Glu 
370 375 380 



Pro Thr Gin Pro Asp Gly Pro Thr Thr Leu Gly Pro Leu Asp Leu Leu 
385 390 395 400 



Pro Ser Glu Glu Leu Leu Thr Asp Thr Ser Asn Ser Ser Ser Ser Thr 
405 410 415 



Gly Glu Glu Gly Leu Gly Cys Pro Ala Ser Arg Pro Leu Arg Pro Pro 
420 425 430 



Pro His Gin Leu Cys Leu Pro Cys Gly Arg 
435 440 



<210> 104 

<211> 99 

<212> PRT 

<213> Homo sapien 

<400> 104 

Met Glu Asp Pro Ser Val Arg Cys Ser Phe Gin Leu Thr Ser Gly Arg 
15 10 15 



Arg Thr Ser Ala Met Lys Val Thr Gly lie Phe Leu Leu Ser Ala Leu 
20 25 30 



Ala Leu Leu Ser Leu Ser Gly Asn Thr Gly Ala Asp Ser Leu Gly Arg 
35 40 45 



Glu Ala Lys Cys Tyr Asn Glu Leu Asn Gly Cys Thr Lys lie Tyr Asp 
50 55 60 



Pro Val Cys Gly Thr Asp Gly Asn Thr Tyr Pro Asn Glu Cys Val Leu 
65 70 75 . 80 



Cys Phe Glu Asn Arg Lys Arg Gin Thr Ser lie Leu lie Gin Lys Ser 
85 90 95 



Gly Pro Cys 
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<210> 105 

<211> 106 

<212> PRT 

<213> Homo sapien 

<400> 105 

Ala Leu Asp Val Lys Val Ser His Glu Gly Gly Arg Ser Val Leu Pro 
1 5 10 15 



Gin Ser Lys Tyr Ser Val Arg His His He Pro Tyr Pro Ser Asn Gly 
20 25 30 



Glu Leu His Ser Gin Tyr His Gly Tyr Tyr Val Lys Leu Asn Ala Pro 
35 40 45 



Gin His Pro Pro Val Asp Val Glu Asp Gly Asp Gly Ser Ser Gin Ser 
50 55 60 



Ser Ser Ala Leu Val His Lys Pro Ser Ala Asn Lys Trp Ser Pro Ser 
65 70 75 " 80 



Lys Ser Val Thr Lys Pro Val Ala Lys Glu Ser Lys Ala Glu Pro Lys 
85 90 95 



Ala Lys Lys Ser Glu Leu Ala He Pro Lys 
100 105 



<210> 106 

<211> 93 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MI SC__FE ATURE 

<222> (93).. (93) 

<223> X=any amino acid 



<400> 106 

Ala Leu Asp Val Lys Val Ser His Glu Gly Gly Arg Ser Val Leu Pro 
1 5 10 15 



Gin Ser Lys Tyr Ser Val Arg His His He Pro Tyr Pro Ser Asn Gly 
20 25 30 



Glu Leu His Ser Gin Tyr His Gly Tyr Tyr Val Lys Leu Asn Ala Pro 
35 40 45 



Gin His Pro Pro Val Asp Val Glu Asp Gly Asp Gly Ser Ser Gin Ser 
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50 55 60 



Ser Ser Ala Leu Val His Lys Pro Ser Ala Asn Lys Trp Ser Pro Ser 
65 70 75 80 



Lys Ser Val Thr Lys Pro Val Ala Lys Glu Ser Lys Xaa 
85 90 



<210> 107 

<211> 100 

<212> PRT 

<213> Homo sapien 

<400> 107 

Met Tyr Met Tyr lie Gin Thr Thr Thr Leu Thr Val Ser Met Ser Leu 
15 10 15 



Ser Ala Ser Val Ser Leu Gly Met Leu Tyr Met Pro Lys Val Tyr lie 
20 25 30 



lie lie Phe His Pro Glu Gin Asn Val Gin Lys Arg Lys Arg Ser Phe 
35 40 45 



Lys Ala Val Val Thr Ala Ala Thr Met Gin Ser Lys Leu He Gin Lys 
50 55 60 



Gly Asn Asp Arg Pro Asn Gly Glu Val Lys Ser Glu Leu Cys Glu Ser 1 
65 70 75 80 



Leu Glu Thr Asn Thr Ser Ser Thr Lys Thr Thr Tyr He Ser Tyr Ser 
85 90 95 



Asn His Ser He 
100 



<210> 108 

<211> 558 

<212> PRT 

<213> Homo sapien 

<400> 108 

Trp Lys Phe Thr Val Thr Phe Lys Ser Arg Lys Leu Ala Asn Ala His 
15 10 15 



Arg Ala Phe Ser Pro Trp Ala Leu Met Val Ala Ser Arg Arg Cys Ser 
20 25 30 
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Leu Arg Trp Cys Pro Phe Cys Cys Val Ala Arg lie Asn Phe Gly Ser 
35 40 45 



Trp lie Ala lie Pro Pro Val . Glu Lys Met Val Cys Glu Gly Lys Arg 
50 55 60 



Ser Ala Ser Cys Pro Cys Phe Phe Leu Leu Thr Ala Lys Phe Tyr Trp 
65 70 75 80 



He Leu Thr Met Met Gin Arg Thr His Ser Gin Glu Tyr Ala His Ser 
85 90 95 



He Arg Val Asp Gly Asp He He Leu Gly Gly Leu Phe Pro Val His 
100 105 110 



Ala Lys Gly Glu Arg Gly Val Pro Cys Gly Glu Leu Lys Lys Glu Lys 
115 120 125 



Gly He His Arg Leu Glu Ala Met Leu Tyr Ala He Asp Gin He Asn 
130 135 140 



Lys Asp Pro Asp Leu Leu Ser Asn He Thr Leu Gly Val Arg He Leu 
145 150 155 " 160 



Asp Thr Cys Ser Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Thr Phe 
165 170 175 



Val Gin Ala Leu He Glu Lys Asp Ala Ser Asp Val Lys Cys Ala Asn 
180 185 190 



Gly Asp Pro Pro He Phe Thr Lys Pro Asp Lys He Ser Gly Val He 
195 200 205 



Gly Ala Ala Ala Ser Ser Val Ser He Met Val Ala Asn He Leu Arg 
210 215 220 



Leu Phe Lys He Pro Gin He Ser Tyr Ala Ser Thr Ala Pro Glu Leu 
225 230 235 240 



Ser Asp Asn Thr Arg Tyr Asp Phe Phe Ser Arg Val Val Pro Pro Asp 
245 250 255 



Ser Tyr Gin Ala Gin Ala Met Val Asp He Val Thr Ala Leu Gly Trp 
260 265 270 



Asn Tyr Val Ser Thr Leu Ala Ser Glu Gly Asn Tyr Gly Glu Ser Gly 
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275 



280 



285 



Val Glu Ala Phe Thr Gin He Ser Arg Glu He Gly Gly Val Cys He 
290 295 300 



Ala Gin Ser Gin Lys He Pro Arg Glu Pro Arg Pro Gly Glu Phe Glu 
305 310 315 320 



Lys He He Lys Arg Leu Leu Glu Thr Pro Asn Ala Arg Ala Val He 
325 330 335 



Met Phe Ala Asn Glu Asp Asp He Arg Arg He Leu Glu Ala Ala Lys 
340 345 350 



Lys Leu Asn Gin Ser Gly His Phe Leu Trp He Gly Ser Asp Ser Trp 
355 360 365 



Gly Ser Lys He Ala Pro Val Tyr Gin Gin Glu Glu He Ala Glu Gly 
370 375 380 



Ala Val Thr lie Leu Pro Lys Arg Ala Ser He Asp Gly Phe Asp Arg 
385 390 395 400 



Tyr Phe Arg Ser Arg Thr Leu Ala Asn Asn Arg Arg Asn Val Trp Phe 
405 410 415 



Ala Glu Phe Trp Glu Glu Asn Phe Gly Cys Lys Leu Gly Ser His Gly 
420 425 430 



Lys Arg Asn Ser His He Lys Lys Cys Thr Gly Leu Glu Arg He Ala 
435 440 445 



Arg Asp Ser Ser Tyr Glu Gin Glu Gly Lys Val Gin Phe Val He Asp 
450 455 460 



Ala Val Tyr Ser Met Ala Tyr Ala Leu His Asn Met His Lys Asp Leu 
465 470 475 480 



Cys Pro Gly Tyr He Gly Leu Cys Pro Arg Met Ser Thr He Asp Gly 
485 490 495 



Lys Glu Leu Leu Gly Tyr He Arg Ala Val Asn Phe Asn Gly Cys Arg 
500 505 510 



Arg Gly He Gin Met Ser Leu Pro Trp Pro Thr Leu Phe Thr Pro Ser 
515 520 525 
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Phe Ser Ser Ser Trp Ala Val Leu Ala Leu Leu Ser Leu Leu Met Lys 
530 535 540 



Asp val Met lie Ser Ser Ser lie Lys 
550 555 



545 
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Met Val Cys Glu Gly Lys Arg Ser Ala Ser Cys Pro Cys Phe Phe Leu 
1 5 io 15 

Leu Thr Ala Lys Phe Tyr Trp lie Leu Thr Met Met Gin Arg Thr His 
20 25 30 

Ser Gin Glu Tyr Ala His Ser lie Arg Val Asp Gly Asp lie He Leu 
35 40 45 

Gly Gly Leu Phe Pro Val His Ala Lys Gly Glu Arg Gly Val Pro Cys 
50 55 60 > * 

Gly Glu Leu Lys Lys Glu Lys Gly He His Arg Leu Glu Ala Met Leu 
65 ™ 75 go 

Tyr Ala He Asp Gin lie Asn Lys Asp Pro Asp Leu Leu Ser Asn He 
85 90 95 

Thr Leu Gly Val Arg He Leu Asp Thr Cys Ser Arg Asp Thr Tyr Ala 
100 105 no 

Leu Glu Gin Ser Leu Thr Phe Val Gin Ala Leu He Glu Lys Asp Ala 
115 120 125 

Ser Asp Val Lys Cys Ala Asn Gly Asp Pro Pro -He Phe Thr Lys Pro 
130 135 140 

Asp Lys He Ser Gly Val He Gly Ala Ala Ala Ser Ser Val Ser He 
145 ISO 155 160 

Met Val Ala Asn He Leu Arg Leu Phe Lys He Pro Gin He Ser Tyr 
165 170 175 
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Ala Ser Thr Ala Pro Glu Leu Ser Asp Asn Thr Arg Tyr Asp Phe Phe 
180 185 - 190 

Ser Arg Val Val Pro Pro Asp Ser Tyr Gin Ala Gin Ala Met Val Asp 
195 200 205 

He Val Thr Ala Leu Gly Trp Asn Tyr Val Ser Thr Leu Ala Ser Glu 
210 215 220 

Gly Asn Tyr Gly Glu Ser Gly Val Glu Ala Phe Thr Gin He Ser Arg 
225 230 235 240 



Glu He Gly Gly Val Cys lie Ala Gin Ser Gin Lys He Pro Arg Glu 
245 250 255 

■ -i 

! 

- j Pro Arg Pro Gly Glu Phe Glu Lys He He Lys Arg Leu Leu Glu Thr 

] 260 265 270 



Pro Asn Ala Arg Ala Val He Met Phe Ala Asn Glu Asp Asp He Arg 
275 280 285 

Arg He Leu Glu Ala Ala Lys Lys Leu Asn Gin Ser Gly His Phe Leu 
290 295 ' 300 

Trp He Gly Ser Asp Ser Trp Gly Ser Lys He Ala Pro Val Tyr Gin 
305 310 315 320 

Gin Glu Glu lie Ala Glu Gly Ala Val Thr He Leu Pro Lys Arg Ala 
325 330 335 

Ser He Asp Gly Phe Asp Arg Tyr Phe Arg Ser Arg Thr Leu Ala Asn 
340 345 350 



Asn Arg Arg Asn Val Trp Phe Ala Glu Phe Trp Glu Glu Asn Phe Gly 

355 360 365 

Cys Lys Leu Gly Ser His Gly Lys Arg Asn Ser His He Lys Lys Cys 

370 375 380 

Thr Gly Leu Glu Arg He Ala Arg Asp Ser Ser Tyr Glu Gin Glu Gly 

385 390 395 400 

Lys Val Gin Phe Val He Asp Ala Val Tyr Ser Met Ala Tyr Ala Leu 

405 410 415 

His Asn Met His Lys Asp Leu Cys Pro Gly Tyr He Gly Leu Cys Pro 
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420 425 430 



Arg Met Ser Thr He Asp Gly Lys Glu Leu Leu Gly Tyr He Arg Ala 
435 440 445 



Val Asn Phe Asn Gly Cys Arg Arg Gly He Gin Met Ser Leu Pro Trp 
450 455 460 



Pro Thr Leu Phe Thr Pro Ser Phe Ser Ser Ser Trp Ala Val Leu Ala 
465 470 475 480 



Leu Leu Ser Leu Leu Met Lys Thr Glu Met Leu Leu Asp Val Met He 
485 490 495 



Ser Ser Ser He Lys 
500 



<210> 110 

<211> 338 

<212> PRT 

<213> Homo sapien 

<400> 110 

Trp Lys Phe Thr Val Thr Phe Lys Ser Arg Lys Leu Ala Asn Ala His 
1 5 10 15 



Arg Ala Phe Ser Pro Trp Ala Leu Met Val Ala Ser Arg Arg Cys Ser 
20 25 30 



Leu Arg Trp Cys Pro Phe Cys Cys Val Ala Arg He Asn Phe Gly Ser 
35 40 45 



Trp He Ala He Pro Pro Val Glu Lys Met Val Cys Glu Gly Lys Arg 
50 55 60 



Ser Ala Ser Cys Pro Cys Phe Phe Leu Leu Thr Ala Lys Phe Tyr Trp 
65 70 75 80 



He Leu Thr Met Met Gin Arg Thr His Ser Gin Glu Tyr Ala His Ser 
85 90 95 



He Arg Val Asp Gly Asp He He Leu Gly Gly Leu Phe Pro Val His 
100 105 110 



Ala Lys Gly Glu Arg Gly Val Pro Cys Gly Glu Leu Lys Lys Glu Lys 
115 120 125 
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Gly lie His Arg Leu Glu Ala Met Leu Tyr Ala He Asp Gin He Asn 
130 135 140 



Lys Asp Pro Asp Leu Leu Ser Asn He Thr Leu Gly Val Arg He Leu 
145 150 155 160 



Asp Thr Cys Ser Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Thr Phe 
165 170 175 



Val Gin Ala Leu He Glu Lys Asp Ala Ser Asp Val Lys Cys Ala Asn 
180 185 190 



Gly Asp Pro Pro He Phe Thr Lys Pro Asp Lys He Ser Gly Val He 
195 200 205 



Gly Ala Ala Ala Ser Ser Val Ser He Met Val Ala Asn He Leu Arg 
210 215 220 



Leu Phe Lys He Pro Gin He Ser Tyr Ala Ser Thr Ala Pro Glu Leu 
225 230 235 240 



Ser Asp Asn Thr Arg Tyr Asp Phe Phe Ser Arg Val Val Pro Pro Asp 
245 250 255 



Ser Tyr Gin Ala Gin Ala Met Val Asp He Val Thr Ala Leu Gly Trp 
260 265 270 



Asn Tyr Val Ser Thr Leu Ala Ser Glu Gly Asn Tyr Gly Glu Ser Gly 
275 280 285 



Val Glu Ala Phe Thr Gin He Ser Arg Glu He Gly Tyr Pro Ser Leu 
290 295 300 



Phe Gly He Gin Gly Cys Leu His Glu Cys Phe Ala lie Leu Cys Gin 
305 310 315 320 



Val Val Tyr Gin Phe Leu Leu Leu Met Gin Leu Ser Asp Ala Gin Thr 
325 330 335 



Val Tyr 



<210> 111 

<211> 281 

<212> PRT 

<213> Homo sapien 
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<400> ill 

Met Val Cys Glu Gly Lys Arg Ser Ala Ser Cys Pro Cys Phe Phe Leu 
15 10 15 



Leu Thr Ala Lys Phe Tyr Trp lie Leu Thr Met Met Gin Arg Thr His 
20 25 30 



Ser Gin Glu Tyr Ala His Ser lie Arg Val Asp Gly Asp lie He Leu 
35 40 45 



Gly Gly Leu Phe Pro Val His Ala Lys Gly Glu Arg Gly Val Pro Cys 
50 55 60 



Gly Glu Leu Lys Lys Glu Lys Gly He His Arg Leu Glu Ala Met Leu 
65 70 75 80 



Tyr Ala He Asp Gin He Asn Lys Asp Pro Asp Leu Leu Ser Asn He 
85 90 95 



Thr Leu Gly Val Arg He Leu Asp Thr Cys Ser Arg Asp Thr Tyr Ala 
100 105 110 



Leu Glu Gin Ser Leu Thr Phe Val Gin Ala Leu He Glu Lys Asp Ala 
115 120 125 



Ser Asp Val Lys Cys Ala Asn Gly Asp Pro Pro He Phe Thr Lys Pro 
130 135 140 



Asp Lys He Ser Gly Val lie Gly Ala Ala Ala Ser Ser Val Ser He 
145 150 155 160 



Met Val Ala Asn He Leu Arg Leu Phe Lys He Pro Gin He Ser Tyr 
165 170 175 



Ala Ser Thr Ala Pro Glu Leu Ser Asp Asn Thr Arg Tyr Asp Phe Phe 
180 185 190 



Ser Arg Val Val Pro Pro Asp Ser Tyr Gin Ala Gin Ala Met Val Asp 
195 200 205 



He Val Thr Ala Leu Gly Trp Asn Tyr Val Ser Thr Leu Ala Ser Glu 
210 215 220 



Gly Asn Tyr Gly Glu Ser Gly Val Glu Ala Phe Thr Gin lie Ser Arg 
225 230 235 240 
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Glu He Gly Tyr Pro Ser Leu Phe Gly He Gin Gly Cys Leu His Glu 
245 250 255 



Cys Phe Ala He Leu Cys Gin Val Val Tyr Gin Phe Leu Leu Leu Met 
260 265 270 



Gin Leu Ser Asp Ala Gin Thr Val Tyr 

280 





275 


<210> 


112 


<211> 
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PRT 


<213> 
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<400> 


112 



Met Val Val Phe Leu Ser Gly Asp Asn Leu Ser Ser Pro Ser His Ser 
1 5 10 15 



Gly He He Trp Gin Cys Leu Glu Thr Phe Trp Val Leu Thr Asn Arg 
20 25 30 



Arg Arg Met Pro Leu Ala Ser Ser Arg Ser Ala Met Leu Leu Ser He 
35 40 45 



Leu Gin Cys Pro Gly Gin Pro His Lys Asp Ser Ser Ser Leu Asn Val 
50 55 60 



Asn Ser Ala Glu Val Glu Glu Pro Tyr Phe Thr Gly Thr Arg Gin Glu 
65 70 75 80 



Glu Ser Val Cys Pro Arg Thr Leu Val He He Thr Cys Pro Leu Gin 
85 90 95 



Tyr Ser Thr Thr Gin Arg Gin Cys Phe Leu Gly Asn Gly Phe Phe Ser 
100 105 110 



Leu Gin Val Leu His Phe Glu Asn Val Lys Asp Val Pro Phe Gly Phe 
115 120 125 



Gin Thr Val Thr Ser Asp Val Asn Lys Leu Ser Ser Phe Tyr Ser Leu 
130 135 140 



Lys Leu He Lys Arg Leu Tyr Val Asp Lys Ser Leu Asn Leu Ser Thr 
145 150 155 160 
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Glu Phe lie Ser Ser Thr Lys Arg Pro Tyr Ala Lys Glu Leu Glu Thr 
165 170 175 



Val Asp Phe Lys Asp Lys Leu Glu Glu Thr Lys Gly Gin lie Asn Asn 
180 185 190 



Ser He Lys Asp Leu Thr Asp Gly His Phe Glu Asn He Leu Ala Asp 
195 200 205 



Asn Ser Val Asn Asp Gin Thr Lys He Leu Val Val Asn Ala Ala Tyr 
210 215 220 



Phe Val Gly Lys Trp Met Lys Lys Phe Pro Glu Ser Glu Thr Lys Glu 
225 230 235 240 



Cys Pro Phe Arg Val Asn Lys Thr Asp Thr Lys Pro Val Gin Met Met 
245 250 255 



Asn Met Glu Ala Thr Phe Cys Met Gly Asn He Asp Ser He Asn Cys 
260 265 " 270 



Lys He lie Glu Leu Pro Phe Gin Asn Lys His Leu Ser Met Phe He 
275 280 285 



Leu Leu Pro Lys Asp Val Glu Asp Glu Ser Thr Gly Leu Glu Lys He 
290 295 300 



Glu Lys Gin Leu Asn Ser Glu Ser Leu Ser Gin Trp Thr Asn Pro Ser 
305 310 315 ' 320 



Thr Met Ala Asn Ala Lys Val Lys Leu Ser He Pro Lys Phe Lys Val 
325 330 ~ 335 



Glu Lys Met He Asp Pro Lys Ala Cys Leu Glu Asn Leu Gly Leu Lys 
340 345 350 



His He Phe Ser Glu Asp Thr Ser Asp Phe Ser Gly Met Ser Glu Thr 
355 360 365 



Lys Gly Val Ala Leu Ser Asn Val He His Lys Val Cys Leu Glu He 
370 375 380 



Thr Glu Asp Gly Gly Asp Ser He Glu Val Pro Gly Ala Arg He Leu 
385 390 395 400 



Gin His Lys Asp Glu Leu Asn Ala Asp His Pro Phe He Tyr He He 
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405 410 415 



Arg His Asn Lys Thr Arg Asn lie lie Phe Phe Gly Lys Phe Cys Ser 
420 425 430 



Pro 
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Met Glu Arg Met Trp Thr Ser Glu Glu Phe Ser Phe Lys Lys Asn lie 
15 10 15 



Met Lys Tyr lie Leu lie Thr Gin Gly Leu Leu Leu Val Glu Val Tyr 
20 25 30 



Asn Leu Asp Thr Ser Thr Phe Cys Phe Cys Cys Ser Thr Val His Lys 
35 40 45 



Thr Phe Ser Asn Leu Gin Gly Lys lie Val Glu Val Val Glu Val Lys 
50 55 60 



Leu Lys Phe Lys lie Asp Ser Val Asn Thr lie Gly Lys Glu Ala Glu 
65 ' 70 75 80 



Leu Arg lie Phe Trp Leu Phe Met Cys Ser Val Leu Leu Ser Ser His 
85 90 95 



Arg Ser Cys Val Val Thr Gin Asp Ser Gly Ser Thr Lys Pro Glu Leu 
100 105 110 



Thr Gly Trp lie Asp Leu Gin Pro lie Phe Phe His Tyr Thr Ala Ala 
115 120 125 



Phe Leu Phe Leu Lys Lys Gin Lys Ser Met Glu Gin Leu Leu Gin Lys 
130 135 140 



Glu Ala Pro Leu Phe Asn Trp Leu Thr Glu Ser Met Asn His Cys Arg 
145 150 155 160 



Phe Ala Cys Pro Pro Val Cys Phe Ser His Arg Phe Tyr Leu Ser Glu 
165 170 175 
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Trp Leu Leu His Leu Asn Thr Val Ala Met Val Ser Glu Gin Leu Lys 
180 185 190 



Thr Ser Phe Leu His Leu His Ser Lys Gly Phe lie Phe Ser Arg Asn 
195 200 205 



Lys Val Val Leu Gly Cys Phe Asn Ser Val Asp His Leu Glu Thr Arg 
210 215 220 



Glu Gin Gly Pro Trp lie Thr Met Ser Thr Leu Gin Val Phe Gly Asn 
225 230 235 240 



Ala Gly Lys Gly Leu His Ser Ser Asn Leu Leu Thr Ser Phe Leu Ser 
2 45 250 255 



Val Arg Pro Met Leu Ala Cys Asn Tyr lie Phe Leu Asn Leu Lys Thr 
260 265 270 



Lys Glu lie Ser Asp lie Phe Ser Val Ser Phe Phe Ser Phe His Ser 
275 280 285 



Ser Leu Lys Ser His Tyr Cys He Thr Phe Gin He Met Thr Trp He 
290 295 300 



Glu Gly Ser Leu Phe Cys Gin Gly Leu Cys Glu Val Asn He Arg He 
3 °5 310 315 320 



Ser Thr Thr Thr Lys Leu Lys Asn Glu Ala Ala Leu Thr Glu Met Leu 
325 330 335 



Ala Lys Thr Asn Ser Leu Asp Cys Phe Cys His His Leu Leu Thr Trp 
340 345 350 



Asn Ser Phe His He Tyr Phe Ser Tyr Glu Glu He Gin Cys Gin Ser 
355 360 365 



Asp His Val Thr Ser Lys Gin Asn Phe He Glu Ala Tyr Pro Cys Leu 
370 375 380 



Asn Tyr Ser Lys He Lys Tyr Asn Lys Val Asn Tyr Phe Ala Leu Cys 
385 390 395 400 



Ser Ser He Val Leu Ser Tyr He Tyr He Leu He Cys He Leu Leu 
405 410 415 
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Ser Lys Val Ser Glu Thr Thr Cys Trp Arg Ala Val Ala Phe Arg Gin 
420 425 430 



His Gin His His Leu Lys Ala Leu Leu Asn Thr His Tyr Trp Lys Arg 
435 440 445 



Gly Glu Asn Thr Gly lie Met Pro Val He Pro Val Leu Trp Glu Thr 
450 455 460 



Glu Ala Gly Gly Ser Arg Gly Ala Gin Glu Phe Lys He Arg Leu Gly 
465 470 475 480 



Asn Thr Ala Arg Pro Leu Ser Leu Gin Lys Gin Lys Gin Lys Thr Asn 
485 490 495 



Arg His Gly Ser Ala Tyr Leu Trp Ser Glu Leu Leu Gly Ser Leu Arg 
500 505 510 



Gin Glu Asp His Leu Asn Pro Gly Val Arg Gly Cys Ser Glu Leu Leu 
515 520 525 



Leu Gly Gin Leu Gin Ser Ser Leu Gly Ser Lys Val Arg Pro Cys Leu 
530 535 540 



Leu Lys Lys Gin Asn Gin Lys Leu Asp Cys Trp Gly Pro Pro Arg Val 
545 550 555 * 560 



Ser Asp Ser Val Gly Leu Ala Val Gly Met Val Gly Leu Val He Cys 
565 570 575 



Leu Ser Asn Lys Arg Pro Gly Asn Ala Pro Ala Ala Gly Pro Gly Pro 
580 585 590 



His Phe Ala Asp Tyr Tyr Ser Glu Lys Ser Thr Asn Ser Asp Val Gly 
595 600 605 



Phe Leu Leu Phe Leu He Ser Glu Phe He Trp Gly Arg Leu Phe Asn 
610 615 620 



Leu Val Asp Pro His Leu Leu Ser Cys Glu Val Glu Val Asn Ala Thr 
625 630 635 640 



Cys Leu Glu Thr Val Arg Met Arg Asn Ser Glu Leu Lys Tyr Gin Ala 
645 650 655 



Gin Arg Ala Trp Met Leu Glu Glu Met Leu Thr Pro Met Val Arg Pro 
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660 665 670 



Ser Leu Thr Leu Leu Ser Tyr Lys Val Arg Ser Glu lie Gin Tyr Leu 
675 680 685 



Leu Arg Phe Leu Ser Gly Val Lys Cys Cys Asp Ser Lys Ser Leu He 
690 695 700 



Leu 




705 
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<400> 
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Leu He He Asn Asp Cys Tyr Ser His Ser He Gly Gly Val Val Leu 
1 5 10 is 



Val Asn Leu Ala Gly Ser Glu Lys Pro Ala Gly Arg Asp Thr Val Gly 
20 25 ^ 30 



Ser He Trp Ser Leu Ala Gly Ala Met Leu Tyr Ala Val Tyr He Val 
35 40 45 



Met He Lys Arg Lys Val Asp Arg Glu Asp Lys Leu Asp He Pro Met 
50 55 60 



Phe Phe Gly Phe Val Gly Leu Phe Asn Leu Leu Leu Leu Trp Pro Gly 
65 70 75 80 

Phe Phe Leu Leu His Tyr Thr Gly Phe Glu Asp Phe Glu Phe Pro Asn 
85 90 95 



Lys Val Val Leu Met Cys He He He Asn Gly Leu He Gly Thr Val 
100 105 " no 



Leu Ser Glu Phe Leu Trp Leu Trp Gly Cys Phe Leu Thr Ser Ser Leu 
115 120 125 



He Gly Thr Leu Ala Leu Ser Leu Thr He Pro Leu Ser He He Ala 
130 135 140 



Asp Met Cys Met Gin Lys Val Gin Phe Ser Trp Leu Phe Phe Ala Gly 
145 150 155 160 
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Ala lie Pro Val Phe Phe Ser Phe Phe He Val Thr Leu Leu Cys His 
165 170 175 



Tyr Asn Asn Trp Asp Pro Val Met Val Gly He Arg Arg He Phe Ala 
180 185 190 



Phe He Cys Arg Lys His Arg He Gin Arg Val Pro Glu Asp Ser Glu 
195 200 205 



Gin Cys Glu Ser Leu He Ser Met His Ser Val Ser Gin Glu Asp Gly 
210 215 220 



Ala Ser 



225 
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Met Val Pro Pro Arg Arg His Arg Gly Ala Gly Arg Pro Gly Val Leu 
1 5 10 15 



Ser Ser Ser Pro Pro Phe Arg Leu Arg Ser Ala Lys Phe Ser Gly He 
20 25 30 



Ala Leu Glu Asp Leu Arg Arg Ala Leu Lys Thr Arg Leu Gin Met Val 
35 40 45 



Cys Val Phe Val Met Asn Arg Met Asn Ser Gin Asn Ser Gly Phe Thr 
50 55 60 



Gin Arg Arg Arg Met Ala Leu Gly He Val He Leu Leu Leu Val Asp 
65 70 75 80 



Val He Trp Val Ala Ser Ser Glu Leu Thr Ser Tyr Val Phe Thr Gin 
85 90 95 



Tyr Asn Lys Pro Phe Phe Ser Thr Phe Ala Lys Thr Ser Met Phe Val 
100 105 no 



Leu Tyr Leu Leu Gly Phe He He Trp Lys Pro Trp Arg Gin Gin Cys 
115 120 125 



Thr Arg Gly Leu Arg Gly Lys His Ala Ala Phe Phe Ala Asp Ala Glu 
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130 



135 



140 



Gly Tyr Phe Ala Ala Cys Thr Thr Asp Thr Thr Met Asn Ser Ser Leu 
145 150 155 160 



Ser Glu Pro Leu Tyr Val Pro Val Lys Phe His Asp Leu Pro Ser Glu 
165 170 175 



Lys Pro Glu Ser Thr Asn He Asp Thr Glu Lys Thr Pro Lys Lys Ser 
180 185 190 



Arg Val Arg Phe Ser Asn He Met Glu He Arg Gin Leu Pro Ser Ser 
195 200 205 



His Ala Leu Glu Ala Lys Leu Ser Arg Met Ser Tyr Pro Val Lys Glu 
210 215 220 



Gin Glu Ser He Leu Lys Thr Val Gly Lys Leu Thr Ala Thr Gin Val 
225 230 235 240 



Ala Lys He Ser Phe Phe Phe Cys Phe Val Trp Phe Leu Ala Asn Leu 
245 250 255 



Ser Tyr Gin Glu Ala Leu Ser Asp Thr Gin Val Ala He Val Asn He 
260 265 270 



Leu Ser Ser Thr Ser Gly Leu Phe Thr Leu He Leu Ala Ala Val Phe 
27 5 280 285 



Pro Ser Asn Ser Gly Asp Arg Phe Thr Leu Ser Lys Leu Leu Ala Val 
290 295 300 



He Leu Ser He Gly Gly Val Val Leu Val Asn Leu Ala Gly Ser Glu 
305 310 315 " 320 



Lys Pro Ala Gly Arg Asp Thr Val Gly Ser He Trp Ser Leu Ala Gly 
325 330 335 



Ala Met Leu Tyr Ala Val Tyr He Val Met He Lys Arg Lys Val Asp 
340 345 350 



Arg Glu Asp Lys Leu Asp He Pro Met Phe Phe Gly Phe Val Gly Leu 
355 360 365 



Phe Asn Leu Leu Leu Leu Trp Pro Gly Phe Phe Leu Leu His Tyr Thr 
370 375 380 



WO 2004/050858 



168/383 



PCI7US2003/038808 



Gly Phe Glu Asp Phe Glu Phe Pro Asn Lys Val Val Leu Met Cys He 
385 390 395 * 400 



He He Asn Gly Leu He Gly Thr Val Leu Ser Glu Phe Leu Trp Leu 
405 410 415 



Trp Gly Cys Phe Leu Thr Ser Ser Leu He Gly Thr Leu Ala Leu Ser 
420 425 430 



Leu Thr He Pro Leu Ser He He Ala Asp Met Cys Met Gin Lys Val 
435 440 445 



Gin Phe Ser Trp Leu Phe Phe Ala Gly Ala He Pro Val Phe Phe Ser 
450 455 460 



Phe Phe He Val Thr Leu Leu Cys His Tyr Asn Asn Trp Asp Pro Val 
465 470 475 480 



Met Val Gly He Arg Arg He Phe Ala Phe He Cys Arg Lys His Arg 
485 490 495 



He Gin Arg Trp Asn Leu Ala Leu Ser Pro Lys Leu Glu Cys Ser Gly 
500 505 510 



Glu He Ser Gly His Cys Asn Leu Tyr Leu Leu Gly Ser Ser Asp Ser 
515 520 525 



Pro Ala Ser Ala Ser Arg Val Ala Gly Thr Thr Gly Ala Arg His His 
530 535 540 



Thr Gin Leu He Phe Val Phe Leu Val Glu Thr Gly Phe His His Val 
545 550 555 560 



Gly Gin Asp Gly Leu Asp Leu Leu Asn Ser 
565 570 
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<400> 116 



Met Asp Gly Arg Leu Asp Gly Trp Val Asp Gly Arg Gly Trp Pro Trp 
1 5 10 15 
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Met Arg Ser Ala Leu His Thr Gin Thr Arg Trp Glu Arg Phe Val Glu 
20 25 30 

His Asp Ser Leu Gin Gin Glu Tyr Met Cys Ala Tyr Leu Cys Gly Gin 
35 40 45 



Lys Tyr Leu His Leu Gly Phe Gly Ala He Gin Glu Glu Met Ser Gin 
50 55 60 



Lys Gin Leu Asn Gin Gly Leu Ser Thr Leu Trp He Leu Asn Leu Lys 
65 7 ° 75 80 

Met Gly Ala Gly Leu Cys Leu Lys Ala Leu Leu Ser His Leu Leu Gly 
85 go 95 

Pro Trp Phe Asn Lys Ala Leu Ser Lys Leu Arg Lys Thr Thr Thr Thr 
100 105 no 

Thr Gin Asn Lys Ser Ser Phe Phe Ser Ser Phe Leu Gly Lys Lys Ala 
115 120 i25 



Leu Phe Phe Arg Glu Arg Val 
130 las 



<210> 117 

<211> 82 

<212> PRT 

<213> Homo sapien 

<400> 117 

Met Leu Leu Glu Arg Arg Ser Val Met Asp Val Val Ala Ala Glu Val 
1 5 io 15 

Trp Gly Leu Thr Ala Gin Asn Leu Leu Leu Ser Cys Cys Gly Trp Met 
20 25 ~ 30 

Glu Gly Trp Met Asp Gly Trp Met Ala Val Asp Gly Arg Gly Cys Ala 
3 5 40 45 

Val Pro Cys He Pro Lys Pro Gly Gly Ser Val Leu Leu Ser Met Thr 
50 55 60 



Ala Cys Ser Arg Asn He Cys Val Pro lie Cys Val Asp Lys Asn lie 
65 70 75 80 



Tyr Thr 
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Asn His Tyr Arg Arg lie Gly Ala Leu Asp Ala Ala Arg Ala Ala Gin 
15 10 15 



Cys Asp Gly Cys Gly Arg Gly Arg Gly Leu Gly Pro Asp Cys Ser Glu 
20 25 30 



Leu Ala Ala Val Leu Leu Arg Met Asp Gly Arg Leu Asp Gly Trp Val 
35 40 45 



Asp Gly Arg Gly Trp . Pro Trp Met Arg Ser Ala Leu His Thr Gin Thr 
50 55 60 



Arg Trp Glu Arg Phe Val Glu His Asp Ser Leu Gin Gin Glu Tyr Met 
65 70 75 80 



Cys Ala Tyr Leu Cys Gly Gin Lys Tyr Leu His Leu Gly Phe Gly Ala 
85 90 95 



lie Gin Glu Glu Met Ser Gin Lys Gin Leu Asn Gin Gly Leu Ser Thr 
100 105 110 



Leu Trp lie Leu Asn Leu Lys Met Gly Ala Gly Leu Cys Leu Lys Ala 
115 120 125 



Leu Leu Ser His Leu Leu Gly Pro Trp Phe Asn Lys Ala Leu Ser Lys 
130 135 140 



Leu Arg 



145 
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Met Asp Ser Pro Ser Pro Met Asp Pro His Met Ala Glu Gin Thr Phe 
1 5 10 15 



Glu Gin Ala lie Gin Ala Ala Ser Arg lie lie Arg Asn Glu Gin Phe 
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20 



25 



30 



Ala lie Arg Arg Phe Gin Ser Met Pro Val Arg Leu Leu Gly His Ser 
35 40 45 



Pro Val Leu Arg Asn lie Thr Asn Ser Gin Ala Pro Asp Gly Arg Arg 
50 55 60 



Lys Ser Glu Ala Gly Ser Gly Ala Ala Ser Ser Ser Gly Glu. Asp Lys 
65 70 75 80 



Glu Asn Val Arg Phe Trp Lys Ala Gly Val Gly Ala Leu Arg Glu Glu 
85 90 95 



Glu Gly Ala Cys Trp Gly Gly Ser Leu .Ala Cys Glu Asp Pro Pro Leu 
100 105 110 



Pro Ser Trp Leu Gin Asp Gly Phe Val Phe Lys Met Pro Trp Lys Pro 
115 120 125 



Thr His Pro Ser Ser Thr His Ala Leu Ala Glu Trp Ala Ser Arg Arg 
130 135 140 



Glu Ala Phe Ala Gin Arg Pro Ser Ser Ala Pro Asp Leu Met Cys Leu 
145 150 155 160 



Ser Pro Asp Arg Lys Met Glu Val Glu Glu Leu Ser Pro Leu Ala Leu 
165 170 175 



Gly Arg Phe Ser Leu Thr Pro Ala Glu Gly Asp Thr Glu Glu Asp Asp 
180 185 190 



Gly Phe Val Asp lie Leu Glu Ser Asp Leu Lys Val Asn Ser Leu Val 
195 200 205 



Pro Pro Gly Pro Tyr Leu Pro lie Pro Gly Phe Ala Gin Lys Lys Arg 
210 215 220 



Ala Leu Ser Pro Val Trp Gin Asp His Asp Val lie Pro Val Ser Glu 
225 230 235 240 



Glu Glu Ser Glu Val Thr Glu Ser Gin Pro Asn Leu Trp Pro Met Arg 
245 250 255 



Glu Glu Gin Val Ala Gly Ala Thr Val Glu Gly Gly Leu Gly Gly Arg 
260 265 270 
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Gly Ala Ser His Pro Asn Leu Glu Phe Ala Arg Glu Thr Gly Lys Ala 
275 280 285 



Val Leu Glu Asp Ser Gly Thr Gly Gly Val Trp Pro Met Pro Val Gly 
290 295 300 



Cys Asp Pro Val Leu Ala Asn Leu Ala Ser Asp Asp Asp Ala Val Pro 
305 310 315 320 



Pro Gly Met Glu Ser Leu lie Ser Ala Pro Leu Val Lys Thr Leu Glu 
325 330 335 



Lys Glu Glu Glu Lys Asp Leu Val Met Tyr Ser Lys Cys Gin Arg Leu 
340 345 350 



Phe Arg Ser Pro Ser Met Pro Cys Ser Val lie Arg Pro lie Leu Lys 
355 360 365 



Arg Leu Glu Arg Pro Gin Asp Arg Asp Thr Pro Val Gin Asn Lys Arg 
370 375 380 



Arg Arg Ser Val Thr Pro Pro Glu Glu Gin Gin Glu Ala Glu Glu Pro 
385 390 395 400 



Lys Ala Arg Val Leu Arg Ser Lys Ser Leu Cys His Asp Glu lie Glu 
405 410 415 



Asn Leu Leu Asp Ser Asp His Arg Glu Leu lie Gly Asp Tyr Ser Lys 
420 425 430 



Ala Phe Leu Leu Gin Thr Val Asp Gly Lys His Gin Asp Leu Lys Tyr 
435 440 445 



He Ser Pro Glu Thr Met Val Ala Leu Leu Thr Gly Lys Phe Ser Asn 
450 455 460 



He Val Asp Lys Phe Val lie Val Asp Cys Arg Tyr Pro Tyr Glu Tyr 
465 470 475 480 



Glu Gly Gly His He Lys Thr Ala Val Asn Leu Pro Leu Glu Arg Asp 
485 490 495' 



Ala Glu Ser Phe Leu Leu Lys Ser Pro He Ala Pro Cys Ser Leu Asp 
500 505 510 
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Lys Arg Val He Leu He Phe His Cys Glu Phe Ser Ser Glu Arg Gly 
515 520 525 



Pro Arg Met Cys Arg Phe He Arg Glu Arg Asp Arg Ala Val Asn Asp 
530 535 540 



Tyr Pro Ser Leu Tyr Tyr Pro Glu Met Tyr He Leu Lys Gly Gly Tyr 
545 550 555 560 



Lys Glu Phe Phe Pro Gin His Pro Asn Phe Cys Glu Pro Gin Asp Tyr 
565 570 575 



Arg Pro Met Asn His Glu Ala Phe Lys Asp Glu Leu Lys Thr Phe Arg 
580 585 590 



Leu Lys Thr Arg Ser Trp Ala Gly Glu Arg Ser Arg Arg Glu Leu Cys 
595 600 605 



Ser Arg Leu Gin Asp Gin 
610 



<210> 120 

<211> 413 

<212> PRT 

<213> Homo sapien 

<400> 120 

Leu Lys Gly Lys Gin Pro Cys Pro Thr Arg Pro Leu Pro Ser Tyr Pro 
15 10 15 



Trp Val Arg Pro Lys Glu Glu Ser Ser Glu Pro Leu Trp Gin Asp His 
20 25 .30 



Asp Val He Pro Val Ser Glu Glu Glu Ser Glu Val Thr Glu Ser Gin 
35 40 45 



Pro Asn Leu Trp Pro Met Arg Glu Glu Gin Val Ala Gly Ala Thr Val 
50 55 % 60 



Glu Gly Gly Leu Gly Gly Arg Gly Ala Ser His Pro Asn Leu Glu Phe 
65 70 75 80 



Ala Arg Glu Thr Gly Lys Ala Val Leu Glu Asp Ser Gly Thr Gly Gly 
85 90 95 



Val Trp Pro Met Pro Val Gly Cys Asp Pro Val Leu Ala Asn Leu Ala 
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100 



105 



110 



Ser Asp Asp Asp Ala Val Pro Pro Gly Met Glu Ser Leu He Ser Ala 
115 120 125 

Pro Leu Val Lys Thr Leu Glu Lys Glu Glu. Glu Lys Asp Leu Val Met 
130 135 140 

Tyr Ser Lys Cys Gin Arg Leu Phe Arg Ser Pro Ser Met Pro Cys Ser 
145 ' ' 150 155 160 



Val He Arg Pro He Leu Lys Arg Leu Glu Arg Pro Gin Asp Arg Asp 
165 170 175 



Thr Pro Val Gin Asn Lys Arg Arg Arg Ser Val Thr Pro Pro Glu Glu 
180 185 190 

Gin Gin Glu Ala Glu Glu Pro Lys Ala Arg Val Leu Arg Ser Lys Ser 
195 200 205 

Leu Cys His Asp Glu He Glu Asn Leu Leu Asp Ser Asp His Arg Glu 
210 215 220 



Leu He Gly Asp Tyr Ser Lys Ala Phe Leu Leu Gin Thr Val Asp Gly 
225 230 235 240 



Lys His Gin Asp Leu Lys Tyr He Ser Pro Glu Thr Met Val Ala Leu 
245 250 255 



Leu Thr Gly Lys Phe Ser Asn He Val Asp Lys Phe Val He Val Asp 
260 265 270 



Cys Arg Tyr Pro Tyr Glu Tyr Glu Gly Gly His He Lys Thr Ala Val 
275 280 285 



Asn Leu Pro Leu Glu Arg Asp Ala Glu Ser Phe Leu Leu Lys Ser Pro 
290 295 300 

He Ala Pro Cys Ser Leu Asp Lys Arg Val He Leu He Phe His Cys 
305 310 315 320 



Glu Phe Ser Ser Glu Arg Gly Pro Arg Met Cys Arg Phe He Arg Glu 
325 ~ 330 335 



Arg Asp Arg Ala Val Asn Asp Tyr Pro Ser Leu Tyr Tyr Pro Glu Met 
340 345 350 
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Tyr lie Leu Lys Gly Gly Tyr Lys Glu Phe Phe Pro Gin His Pro Asn 
355 360 365 



Phe Cys Glu Pro Gin Asp Tyr Arg Pro Met Asn His Glu Ala Phe Lys 
370 375 380 



Asp Glu Leu Lys Thr Phe Arg Leu Lys Thr Arg Ser Trp Ala Gly Glu 
385 390 395 400 



Arg Ser Arg Arg Glu Leu Cys Ser Arg Leu Gin Asp Gin 
405 410 



<210> 121 

<211> 331 

<212> PRT 

<213> Homo sapien 

<400> 121 

Met Glu Asp Gly Val Leu Lys Glu Gly Phe Leu Val Lys Arg Gly His 
15 10 15 



lie Val His Asn Trp Lys Ala Arg Trp Phe He Leu Arg Gin Asn Thr 
20 25 30 



Leu Val Tyr Tyr Lys Leu Glu Gly Gly Arg Arg Val Thr Pro Pro Lys 
35 40 45 



Gly Arg He Leu Leu Asp Gly Cys Thr He Thr Cys Pro Cys Leu Glu 
50 55 60 



Tyr Glu Asn Arg Pro Leu Leu He Lys Leu Lys Thr Gin Thr Ser Thr 
65 70 75 80 



Glu Tyr Phe Leu Glu Ala Cys Ser Arg Glu Glu Arg Asp Ala Trp Ala 
85 90 95 



Phe Glu He Thr Gly Ala He His Ala Gly Gin Pro Gly Lys Val Gin 
100 105 110 



Gin Leu His Ser Leu Arg Asn Ser Phe Lys Leu Pro Pro His He Ser 
115 120 125 



Leu His Arg He Val Asp Lys Met His Asp Ser Asn Thr Gly He Arg 
130 135 140 



WO 2004/050858 



176/383 



PCT/US2003/038808 



Ser Ser Pro Asn Met Glu Gin Gly Ser Thr Tyr Lys Lys Thr Phe Leu 
145 150 155 160 



Gly Ser Ser Leu Val Asp Trp Leu lie Ser Asn Ser Phe Thr Ala Ser 
165 170 175 



Arg Leu Glu Ala Val Thr Leu Ala Ser Met Leu Met Glu Glu Asn Phe 
180 185 190 



Leu Arg Pro Val Gly Val Arg Ser Met Gly Ala lie Arg Ser Gly Asp 
195 200 205 



Leu Ala Glu Gin Phe Leu Asp Asp Ser Thr Ala Leu Tyr Thr Phe Ala 
210 215 220 



Glu Ser Tyr Lys Lys Lys lie Ser Pro Lys Glu Glu lie Ser Leu Ser 
225 230 235 240 



Thr Val Glu Leu Ser Gly Thr Val Val Lys Gin Gly Tyr Leu Ala Lys 
245 250 255 



Gin Gly His Lys Arg Lys Asn Trp Lys Val Arg Arg Phe Val Leu Arg 
260 265 270 



Lys Asp Pro Ala Phe Leu His Tyr Tyr Asp Pro Ser Lys Glu Glu Asn 
275 280 285 



Arg Pro Val Gly Gly Phe Ser Leu Arg Gly Ser Leu Val Ser Ala Leu 
290 295 300 



Glu Asp Asn Gly Val Pro Thr Gly Lys Val Gin Leu Gin Pro Trp Pro 
305 310 315 320 



Glu Asp Leu Arg Gly Leu Pro Pro Ala Pro Gin 
325 330 



<210> 122 

<211> 176 

<212> PRT 

<213> Homo sapien 

<400> 122 

Met Ser lie Met Ser Tyr Asn Gly Gly Ala Val Met Ala Met Lys Gly 
15 10 15 



Lys Asn Cys Val Ala lie Ala Ala Asp Arg Arg Phe Gly lie Gin Ala 
20 25 30 
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Gin Met Val Thr Thr Asp Phe Gin Lys lie Phe Pro Met Gly Asp Arg 
35 40 45 



Leu Tyr lie Gly Leu Ala Gly Leu Ala Thr Asp Val Gin Thr Val Ala 
50 55 60 



Gin Arg Leu Lys Phe Arg Leu Asn Leu Tyr Glu Leu Lys Glu Gly Arg 
65 70 75 80 



Gin lie Lys Pro Tyr Thr Leu Met Ser Met Val Ala Asn Leu Leu Tyr 
85 90 95 



Glu Lys Arg Phe Gly Pro Tyr Tyr Thr Glu Pro Val lie Ala Gly Leu 
100 105 110 



Asp Pro Lys Thr Phe Lys Pro Phe lie Cys Ser Leu Asp Leu lie Gly 
115 120 125 



Cys Pro Met Val Thr Asp Asp Phe Val Val Ser Gly Thr Cys Ala Glu 
130 135 140 



Gin Met Tyr Gly Met Cys Glu Ser Leu Trp Glu Pro Asn Met Val Arg 
145 150 155 160 



Trp Trp His Gly Glu Gly Leu Gly Ser Glu Leu Pro Thr Leu Gly His 
165 170 175 



<210> 123 

<211> 158 

<212> PRT 

<213> Homo sapien 

<400> 123 

Met Ser He Met Ser Tyr Asn Gly Gly Ala Val Met Ala Met Lys Gly 
15 10 15 



Lys Asn Cys Val Ala He Ala Ala Asp Arg Arg Phe Gly He Gin Ala 
20 25 30 



Gin Met Val Thr Thr Asp Phe Gin Lys He Phe Pro Met Gly Asp Arg 
35 40 45 



Leu Tyr He Gly Leu Ala Gly Leu Ala Thr Asp Val Gin Thr Val Ala 
50 55 60 
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Gin Arg Leu Lys Phe Arg Leu Asn Leu Tyr Glu Leu Lys Glu Gly Arg 
65 70 75 80 



Gin He Lys Pro Tyr Thr Leu Met Ser Met Val Ala Asn Leu Leu Tyr 
85 90 95 



Glu Lys Arg Phe Gly Pro Tyr Tyr Thr Glu Pro Val He Ala Gly Leu 
100 105 HO 



Asp Pro Lys Thr Phe Lys Pro Phe He Cys Ser Leu Asp Leu He Gly 
115 120 125 



Cys Pro Met Val Thr Asp Asp Phe Val Val Ser Gly Thr Cys Ala Glu 
130 135 140 



Gin Met Tyr Leu Gly Arg Asp His Ala Lys Pro Asn Ser Ser 

150 155 



145 




<210> 


124 


<211> 


246 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


124 



Met Leu Phe Pro Gin Val Pro Pro Pro Arg Pro Gly Leu Ala Asp Pro 
15 10 15 



Pro Leu Pro Phe Pro Ser Ser Asn Leu Asn Lys Pro Tyr Leu Phe Ser 
20 25 30 



Val Leu Leu Gly Ala Trp Gin Leu Gly Asn Pro Gly Ser Arg Ser Gin 
35 40 45 



Lys Val Gly Val Ala Trp Val Glu Pro His Pro Val Tyr Ser Trp Lys 
50 55 60 



Glu Gly Ala Cys Ala Asp He Ala Leu Val Arg Leu Glu Arg Ser He 
65 70 75 80 



Gin Phe Ser Glu Arg Val Leu Pro He Cys Leu Pro Asp Ala Ser He 
85 90 95 



His Leu Pro Pro Asn Thr His Cys Trp He Ser Gly Trp Gly Ser He 
100 105 HO 



Gin Asp Gly Val Pro Leu Pro His Pro Gin Thr Leu Gin Lys Leu Lys 
115 120 125 
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Val Pro He He Asp Ser Glu Val Cys Ser His Leu Tyr Trp Arg Gly 
130 135 140 



Ala Gly Gin Gly Pro He Thr Glu Asp Met Leu Cys Ala Gly Tyr Leu 
145 150 155 ' ' 160 



Glu Gly Glu Arg Asp Ala Cys Leu Gly Asp Ser Gly Gly Pro Leu Met 
165 170 175 



Cys Gin Val Asp Gly Ala Trp Leu. Leu Ala Gly He He Ser Trp Gly 
180 185 190 



Glu Gly Cys Ala Glu Arg Asn Arg Pro Gly Val Tyr He Ser Leu Ser 
195 200 205 



Ala His Arg Ser Trp Val Glu Lys lie Val Gin Gly Val Gin Leu Arg 
210 215 220 



Gly Arg Ala Gin Gly Gly Gly Ala Leu Arg Ala Pro Ser Gin Gly Ser 
225 230 235 240 



Gly Ala Ala Ala Arg Ser 
245 



<210> 125 

<211> 273 

<212> PRT 

<213> Homo sapien 

<400> 125 

Met Arg Leu Trp Leu Gly Arg Asp Ala Glu Gly Thr Glu Leu Ser Arg 
1 5 10 15 



Arg His Asn Trp Thr Lys Pro Glu Pro Gin Ala Pro Val Ala Trp Glu 
20 25 30 



Arg Val Ala Pro Ser Asn Leu Pro Gin Gly His Pro Leu Pro Lys Ser 
35 40 45 



Phe Ser Ser Pro Pro Ser Pro Ser Asn Lys Arg Glu Glu Glu Glu Glu 
50 55 60 



Glu Phe Asn Phe Glu Val He Pro Pro Pro Pro Glu Phe Ser Asn Asp 
65 70 75 80 
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Pro Glu Pro Pro Ala Pro Ala Leu Gin Tyr Leu Gly Arg Gin Ser Ser 
85 90 95 



Pro Pro Arg Asn Asn Tyr Ser Asp Leu Arg Gin Leu Pro Asn Ala Gly 
100 105 HO 



Pro Gly Ala Pro Pro Ala Leu Gly Phe Ser Arg Phe Pro Ala Gly Ala 
115 120 125 



Arg Tyr Ala Gly Ala Gly Gly Leu Glu Arg Phe Ser Gly Gly Gly Arg 
130 135 140 



Ser Leu lie Lys Lys Arg Leu Tyr Val Gly Glu Pro His Arg Gly Pro 
145 * 150 155 160 



Gly Leu Pro His Gly Gly Thr Gly Arg Ser Leu Ser Ser Pro Asn Cys 
165 170 175 



Phe Gly Pro Gin Pro Gly Gly Pro Glu Met Arg Arg Val Asn Ser Ala 
180 185 190 



Gly Arg Ala Pro Pro Gly Gly Leu His Ala Pro Arg Leu Ser Leu Glu 
195 200 205 



Gly Ala Ala Arg Gly Ala Ala Glu Ala Lys His Lys Ala Pro Gly Ser 
210 215 220 



Ala Asp Tyr Gly Phe Ala Pro Ala Ala Gly Arg Ser Pro Tyr Thr Thr 
225 230 235 240 



Thr Arg Tyr Gly Ser Pro He Asn Thr Phe Thr Val Arg Pro Gly Thr 
245 250 255 



Arg His Pro He Ser Tyr Val Cys Ser Gly Ala His Arg Lys Ala Thr 
260 265 270 



Ser 



<210> 126 

<211> 278 

<212> PRT 

<213> Homo sapien 

<400> 126 

Ser Ser Cys His Asn Leu Arg Leu Trp Leu Gly Arg Asp Ala Glu Gly 
1 5 10 15 
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Thr Glu Leu Ser Arg Arg His Asn Trp Thr Lys Pro Glu Pro Gin Ala 
20 25 30 



Pro Val Ala Trp Glu Arg Val Ala Pro Ser Asn Leu Pro Gin Gly His 
35 40 45 



Pro Leu Pro Lys Ser Phe Ser Ser Pro Pro Ser Pro Ser Asn Lys Arg 
50 55 60 



Glu Glu Glu Glu Glu Glu Phe Asn Phe Glu Val lie Pro Pro Pro Pro 
65 70 75 80 



Glu Phe Ser Asn Asp Pro Glu Pro Pro Ala Pro Ala Leu Gin Tyr Leu 
85 90 95 



Gly Arg Gin Ser Ser Pro Pro Arg Asn Asn Tyr Ser Asp Leu Arg Gin 
100 105 110 



Leu Pro Asn Ala Gly Pro Gly Ala Pro Pro Ala Leu Gly Phe Ser Arg 
115 120 125 



Phe Pro Ala Gly Ala Arg Tyr Ala Gly Ala Gly Gly Leu Glu Arg Phe 
130 135 140 



Ser Gly Gly Gly Arg Ser Leu lie Lys Lys Arg Leu Tyr Val Gly Glu 
145 150 155 160 



Pro His Arg Gly Pro Gly Leu Pro His Gly Gly Thr Gly Arg Ser Leu 
165 170 175 



Ser Ser Pro Asn Cys Phe Gly Pro Gin Pro Gly Gly Pro Glu Met Arg 
180 185 190 



Arg Val Asn Ser Ala Gly Arg Ala Pro Pro Gly Gly Leu His Ala Pro 
195 200 205 



Arg Leu Ser Leu Glu Gly Ala Ala Arg Gly Ala Ala Glu Ala Lys His 
210 215 220 



Lys Ala Pro Gly Ser Ala Asp Tyr Gly Phe Ala Pro Ala Ala Gly Arg 
225 230 235 240 



Ser Pro Tyr Thr Thr Thr Arg Tyr Gly Ser Pro lie Asn Thr Phe Thr 
245 250 255 



WO 2004/050858 



182/383 



PCT/US2003/038808 



Val Arg Pro Gly Thr Arg His Pro He Ser Tyr Val Cys Ser Gly Ala 
260 265 270 



His Arg Lys Ala ,Thr Ser 





275 


<210> 


127 


<211> 


2306 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


127 



Met Ala Ala Leu Val Leu Glu Asp Gly Ser Val Leu Arg Gly Gin Pro 
15 10 15 



Phe Gly Ala Ala Val Ser Thr Ala Gly Glu Val Val Phe Gin Thr Gly 
20 25 30 



Met Val Gly Tyr Pro Glu Ala Leu Thr Asp Pro Ser Tyr Lys Ala Gin 
35 40 45 



He Leu Val Leu Thr Tyr Pro Leu He Gly Asn Tyr Gly He Pro Pro 
50 55 60 



Asp Glu Met Asp Glu Phe Gly Leu Cys Lys Trp Phe Glu Ser Ser Gly 
65 70 75 80 



He His Val Ala Ala Leu Val Val Gly Glu Cys Cys Pro Thr Pro Ser 
85 90 95 



His Trp Ser Ala Thr Arg Thr Leu His Glu Trp Leu Gin Gin His Gly 
100 105 HO 



He Pro Gly Leu Gin Gly Val Asp Thr Arg Glu Leu Thr Lys Lys Leu 
115 120 125 



Arg Glu Gin Gly Ser Leu Leu Gly Lys Leu Val Gin Asn Gly Thr Glu 
130 135 140 



Pro Ser Ser Leu Pro Phe Leu Asp Pro Asn Ala Arg Pro Leu Val Pro 
145 150 155 160 



Glu Val Ser He Lys Thr Pro Arg Val Phe Asn Thr Gly Gly Ala Pro 
165 170 175 



Arg He Leu Ala Leu Asp Cys Gly Leu Lys Tyr Asn Gin He Arg Cys 
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180 185 190 



Leu Cys Gin Arg Gly Ala Glu Val Thr Val Val Pro Trp Asp His Ala 
195 200 205 



Leu Asp Ser Gin Glu Tyr Glu Gly Leu Phe Leu Ser Asn Gly Pro Gly 
210 215 220 



Asp Pro Ala Ser Tyr Pro Ser Val Val Ser Thr Leu Ser Arg Val Leu 
225 230 235 240 



Ser Glu Pro Asn Pro Arg Pro Val Phe Gly lie Cys Leu Gly His Gin 
245 250 255 



Leu Leu Ala Leu Ala lie Gly Ala Lys Thr Tyr Lys Met Arg Tyr Gly 
260 265 270 



Asn Arg Gly His Asn Gin Pro Cys Leu Leu Val Gly Ser Gly Arg Cys 
275 280 285 



Phe Leu Thr Ser Gin Asn His Gly Phe Ala Val Glu Thr Asp Ser Leu 
290 295 300 



Pro Ala Asp Trp Ala Pro Leu Phe Thr Asn Ala Asn Asp Gly Ser Asn 
305 310 315 320 



Glu Gly lie Val His Asn Ser Leu Pro Phe Phe Ser Val Gin Phe His 
325 330 335 



Pro Glu His Gin Ala Gly Pro Ser Asp Met Glu Leu Leu Phe Asp lie 
340 345 350 



Phe Leu Glu Thr Val Lys Glu Ala Thr Ala Gly Asn Pro Gly Gly Gin 
355 360 365 



Thr Val Arg Glu Arg Leu Thr Glu Arg Leu Cys Pro Pro Gly lie Pro 
370 375 380 



Thr Pro Gly Ser Gly Leu Pro Pro Pro Arg Lys Val Leu lie Leu Gly 
385 390 395 400 



Ser Gly Gly Leu Ser lie Gly Gin Ala Gly Glu Phe Asp Tyr Ser Gly 
405 410 415 



Ser Gin Ala lie Lys Ala Leu Lys Glu Glu Asn He Gin Thr Leu Leu 
420 425 430 
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lie Asn Pro Asn lie Ala Thr Val Gin Thr Ser Gin Gly Leu Ala Asp 
435 440 445 



Lys Val Tyr Phe Leu Pro lie Thr Pro His Tyr Val Thr Gin Val He 
450 455 460 



Arg Asn Glu Arg Pro Asp Gly Val Leu Leu Thr Phe Gly Gly Gin Thr 
465 470 475 480 



Ala Leu Asn Cys Gly Val Glu Leu Thr Lys Ala Gly Val Leu Ala Arg 
485 490 495 



Tyr Gly Val Arg Val Leu Gly Thr Thr Val Glu Thr He Glu Leu Thr 
500 505 510 



Glu Asp Arg Arg Ala Phe Ala Ala Arg Met Ala Glu He Gly Glu His 
515 * " 520 525 



Val Ala Pro Ser Glu Ala Gly Asn Ser Leu Glu Gin Ala Gin Ala Ala 
530 535 540 



Ala Glu Arg Leu Gly Tyr Pro Val Leu Val Arg Ala Ala Phe Ala Val 
545 550 555 560 



Gly Gly Leu Gly Ser Gly Phe Ala Ser Asn Arg Glu Glu Leu Ser Ala 
565 570 575 



Leu Val Ala Pro Ala Phe Ala His Thr Ser Gin Val Leu Val Asp Lys 
580 585 590 



Ser Leu Lys Gly Trp Lys Glu He Glu Tyr Glu Val Val Arg Asp Ala 
595 600 605 



Tyr Gly Asn Cys Val Thr Val Cys Asn Met Glu Asn Leu Asp Pro Leu 
610 615 620 



Gly He His Thr Gly Glu Ser He Val Val Ala Pro Ser Gin Thr Leu 
625 630 635 640 



Asn Asp Arg Glu Tyr Gin Leu Leu Arg Gin Thr Ala He Lys Val Thr 
645 650 655 



Gin His Leu Gly He Val Gly Glu Cys Asn Val Gin Tyr Ala Leu Asn 
660 665 670 
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Pro Glu Ser Glu Gin Tyr Tyr lie lie Glu Val Asn Ala Arg Leu Ser 
675 680 685 



Arg Ser Ser Ala Leu Ala Ser Lys Ala Thr Gly Tyr Pro Leu Ala Tyr 
690 695 700 



Val Ala Ala Lys Leu Ala Leu Gly lie Pro Leu Pro Glu Leu Arg Asn 
705 710 715 720 



Ser Val Thr Gly Gly Thr Ala Ala Phe Glu Pro Ser Val Asp Tyr Cys 
725 730 735 



Val Val Lys lie Pro Arg Trp Asp Leu Ser Lys Phe Leu Arg Val Ser 
740 745 750 



Thr Lys He Gly Ser Cys Met Lys Ser Val Gly Glu Val Met Gly He 
755 760 765 



Gly Arg Ser Phe Glu Glu Ala Phe Gin Lys Ala Leu Arg Met Val Asp 
770 775 780 



Glu Asn Cys Val Gly Phe Asp His Thr Val Lys Pro Val Ser Asp Met 
785 790 795 800 



Glu Leu Glu Thr Pro Thr Asp Lys Arg He Phe Val Val Ala Ala Ala 
805 810 815 



Leu Trp Ala Gly Tyr Ser Val Asp Arg Leu Tyr Glu Leu Thr Arg He 
820 825 830 



Asp Arg Trp Phe Leu His Arg Met Lys Arg He lie Ala His Ala Gin 
835 840 845 



Leu Leu Glu Gin His Arg Gly Gin Pro Leu Pro Pro Asp Leu Leu Gin 
850 855 860 



Gin Ala Lys Cys Leu Gly Phe Ser Asp Lys Gin He Ala Leu Ala Val 
865 870 875 . 880 



Leu Ser Thr Glu Leu Ala Val Arg Lys Leu Arg Gin Glu Leu Gly lie 
885 890 895 



Cys Pro Ala Val Lys Gin He Asp Thr Val Ala Ala Glu Trp Pro Ala 
900 905 910 
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Gin Thr Asn Tyr Leu Tyr Leu Thr Tyr Tip Gly Thr Thr His Asp Leu 
915 920 925 



Thr Phe Arg Thr Pro His Val Leu Val Leu Gly Ser Gly Val Tyr Arg 
930 935 940 



lie Gly Ser Ser Val Glu Phe Asp Trp Cys Ala Val Gly Cys He Gin 
945 950 955 960 



Gin Leu Arg Lys Met Gly Tyr Lys Thr He Met Val Asn Tyr Asn Pro 
965 970 975 



Glu Thr Val Ser Thr Asp Tyr Asp Met Cys Asp Arg Leu Tyr Phe Asp 
980 985 990 



Glu He Ser Phe Glu Val Val Met Asp He Tyr Glu Leu Glu Asn Pro 
995 1000 1005 



Glu Gly Val He Leu Ser Met Gly Gly Gin Leu Pro Asn Asn Met 
1010 1015 1020 



Ala Met Ala Leu His Arg Gin Gin Cys Arg Val- Leu Gly Thr Ser 
1025 1030 1035 



Pro Glu Ala He Asp Ser Ala Glu Asn Arg Phe Lys Phe Ser Arg 
1040 1045 1050 



Leu Leu Asp Thr He Gly He Ser Gin Pro Gin Trp Arg Glu Leu 
1055 1060 1065 



Ser Asp Leu Glu Ser Ala Arg Gin Phe Cys Gin Thr Val Gly Tyr 
1070 1075 1080 



Pro Cys Val Val Arg Pro Ser Tyr Val Leu Ser Gly Ala Ala Met 
1085 1090 1095 



Asn Val Ala Tyr Thr Asp Gly Asp Leu Glu Arg Phe Leu Ser Ser 
1100 * 1105 1110 



Ala Ala Ala Val Ser Lys Glu His Pro Val Val He Ser Lys Phe 
1115 1120 1125 



He Gin Glu Ala Lys Glu He Asp Val Asp Ala Val Ala Ser Asp 
1130 1135 1140 



Gly Val Val Ala Ala He Ala lie Ser Glu His Val Glu Asn Ala 
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1145 1150 1155 



Gly Val His Ser Gly Asp Ala Thr Leu Val Thr Pro Pro Gin Asp 
1160 1165 1170 



lie Thr Ala Lys Thr Leu Glu Arg lie Lys Ala lie Val His Ala 
1175 1180 1185 



Val Gly Gin Glu Leu Gin Val Thr Gly Pro Phe Asn Leu Gin Leu 
1190 1195 1200 



lie Ala Lys Asp Asp Gin Leu Lys Val lie Glu Cys Asn Val Arg 
1205 1210 1215 



Val Ser Arg Ser Phe Pro Phe Val Ser Lys Thr Leu Gly Val Asp 
1220 1225 1230 



Leu Val Ala Leu Ala Thr Arg Val He Met Gly Glu Glu Val Glu 
1235 1240 1245 



Pro Val Gly Leu Met Thr Gly Ser Gly Val Val Gly Val Lys Val 
1250 1255 1260 



Pro Gin Phe Ser Phe Ser Arg Leu Ala Gly Ala Asp Val Val Leu 
1265 1270 1275 



Gly Vai Glu Met Thr Ser Thr Gly Glu Val Ala Gly Phe Gly Glu 
1280 1285 1290 



Ser Arg Cys Glu Ala Tyr Leu Lys Ala Met Leu Ser Thr Gly Phe 
1295 1300 1305 



Lys He Pro Lys Lys Asn He Leu Leu Thr He Gly Ser Tyr Lys 
1310 1315 1320 



Asn Lys Ser Glu Leu Leu Pro Thr Val Arg Leu Leu Glu Ser Leu 
1325 1330 1335 



Gly Tyr Ser Leu Tyr Ala Ser Leu Gly Thr Ala Asp Phe Tyr Thr 
1340 1345 1350 



Glu His Gly Val Lys Val Thr Ala Val Asp Trp His Phe Glu Glu 
1355 1360 1365 



Ala Val Asp Gly Glu Cys Pro Pro Gin Arg Ser He Leu Glu Gin 
1370 1375 1380 
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Leu Ala Glu Lys Asn Phe Glu Leu Val lie Asn Leu Ser Met Arg 
1385 1390 1395 



Gly Ala Gly Gly Arg Arg Leu Ser Ser Phe Val Thr Lys Gly Tyr 
1400 1405 1410 



Arg Thr Arg Arg Leu Ala Ala Asp Phe Ser Val Pro Leu lie lie 
1415 ~ 1420 1425 



Asp He Lys Cys Thr Lys Leu Phe Val Glu Ala Leu Gly Gin He 
1430 1435 1440 



Gly Pro Ala Pro Pro Leu Lys Val His Val Asp Cys Met Thr Ser 
1445 1450 1455 



Gin Lys Leu Val Arg Leu Pro Gly Leu He Asp Val His Val His 
1460 1465 1470 



Leu Arg Glu Pro Gly Gly Thr His Lys Glu Asp Phe Ala Ser Gly 
1475 1480 1485 



Thr Ala Ala Ala Leu Ala Gly Gly He Thr Met Val Cys Ala Met 
1490 1495 1500 



Pro Asn Thr Arg Pro Pro He He Asp Ala Pro Ala Leu Ala Leu 
1505 1510 1515 



Ala Gin Lys Leu Ala Glu Ala Gly Ala Arg Cys Asp Phe Ala Leu 
1520 1525 1530 



Phe Leu Gly Ala Ser Ser Glu Asn Ala Gly Thr Leu Gly Thr Val 
1535 1540 1545 



Ala Gly Ser Ala Ala Gly Leu Lys Leu Tyr Leu Asn Glu Thr Phe 
1550 1555 1560 



Ser Glu Leu Arg Leu Asp Ser Val Val Gin Trp Met Glu His Phe 
1565 1570 1575 



Glu Thr Trp Pro Ser His Leu Pro lie Val Ala His Ala Glu Gin 
1580 1585 1590 



Gin Thr Val Ala Ala Val Leu Met Val Ala Gin Leu Thr Gin Arg 
1595 1600 1605 
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Ser Val His lie Cys His Val Ala Arg Lys Glu Glu Val Arg Val 
1610 1615 1620 



His Leu Arg Ser Cys Cys Pro Cys Cys Phe Pro Ser Asn Thr Lys 
1625 1630 1635 



Gly Gin Val Val Leu Arg Gly Arg Arg Gin Gin Glu Glu Ser Leu 
1640 1645 1650 



Glu Thr Ala Gly Gly Gly Ser Arg Ala Ser Thr Leu Ala Gly Leu 
1655 1660 1665 



His His His Phe Phe Glu Gin Gly Cys Trp Pro Leu Gly Leu Cys 
1670 1675 1680 



Val Gly Gin Gly lie Trp Val Val Pro Leu Leu Asp Leu Pro He 
1685 1690 1695 



Val Pro Gin He Leu Leu He Lys Ala Ala Lys Ala Arg Gly Leu 
1700 1705 1710 



Pro Val Thr Cys Glu Val Ala Pro His His Leu Phe Leu Ser His 
1715 1720 1725 



Asp Asp Leu Glu Arg Leu Gly Pro Gly Lys Gly Glu Val Arg Pro 
1730 1735 1740 



Glu Leu Gly Ser Arg Gin Asp Val Glu Ala Leu Trp Glu Asn Met 
1745 1750 1755 



Ala Val He Asp Cys Phe Ala Ser Asp His Ala Pro His Thr Leu 
1760 1765 1770 



Glu Glu Lys Cys Gly Ser Arg Pro Pro Pro Gly Phe Pro Gly Leu 
1775 1780 1785 



Glu Thr Met Leu Pro Leu Leu Leu Thr Ala Val Ser Glu Gly Arg 
1790 1795 1800 



Leu Ser Leu Asp Asp Leu Leu Gin Arg Leu His His Asn Pro Arg 
1805 1810 1815 



Arg He Phe His Leu Pro Pro Gin Glu Asp Thr Tyr Val Glu Val 
1820 1825 1830 
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Asp Leu Glu His Glu Trp Thr lie Pro Ser His Met Pro Phe Ser 
1835 1840 1845 



Lys Ala His Trp Thr Pro Phe Glu Gly Gin Lys Val Lys Gly Thr 
1850 1855 I860 



Val Arg Arg Val Val Leu Arg Gly Glu Val Ala Tyr He Asp Gly 
1865 1870 1875 



Gin Val Leu Val Pro Pro Gly Tyr Gly Gin Asp Val Arg Lys Trp 
1880 1885 1890 



Pro Gin Gly Ala Val Pro Gin Leu Pro Pro Ser Ala Pro Ala Thr 
1895 1900 1905 



Ser Glu Met Thr Thr Thr Pro Glu Arg Pro Arg Arg Gly He Pro 
1910 1915 1920 



Gly Leu Pro Asp Gly Arg Phe His Leu Pro Pro Arg He His Arg 
1925 1930 1935 



Ala Ser Asp Pro Gly Leu Pro Ala Glu Glu Pro Lys Glu Lys Ser 
1940 1945 1950 



Ser Arg Lys Val Ala Glu Pro Glu Leu Met Gly Thr Pro Asp Gly 
1955 I960 1965 



Thr Cys Tyr Pro Pro Pro Pro Val Pro Arg Gin Ala Ser Pro Gin 
1970 1975 1980 



Asn Leu Gly Thr Pro Gly Leu Leu His Pro Gin Thr Ser Pro Leu 
1985 1990 1995 



Leu His Ser Leu Val Gly Gin His lie Leu Ser Val Gin Gin Phe 
2000 2005 2010 



Thr Lys Asp Gin Met Ser His Leu Phe Asn Val Ala His Thr Leu 
2015 2020 2025 



Arg Met Met Val Gin Lys Glu Arg Ser Leu Asp He Leu Lys Gly 
2030 2035 2040 



Lys Val Met Ala Ser Met Phe Tyr Glu Val Ser Thr Arg Thr Ser 
2045 2050 2055 



Ser Ser Phe Ala Ala Ala Met Ala Arg Leu Gly Gly Ala Val Leu 
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2060 2065 2070 



Ser Phe Ser Glu Ala Thr Ser Ser Val Gin Lys Gly Glu Ser Leu 
2075 2080 2085 



Ala Asp Ser Val Gin Thr Met Ser Cys Tyr Ala Asp Val Val Val 
2090 2095 2100 



Leu Arg His Pro Gin Pro Gly Ala Val Glu Leu Ala Ala Lys His 
2105 2110 2115 



Cys Arg Arg Pro Val lie Asn Ala Gly Asp Gly Val Gly Glu His 
2120 2125 2130 



Pro Thr Gin Ala Leu Leu Asp lie Phe Thr lie Arg Glu Glu Leu 
2135 2140 2145 



Gly Thr Val Asn Gly Met Thr He Thr Met Val Gly Asp Leu Lys 
2150 2155 2160 



His Gly Arg Thr Val His Ser Leu Ala Cys Leu Leu Thr Gin Tyr 
2165 2170 2175 



Arg Val Ser Leu Arg Tyr Val Ala Pro Pro Ser Leu Arg Met Pro 
2180 2185 2190 



Pro Thr Val Arg Ala Phe Val Ala Ser Arg Gly Thr Lys Gin Glu 
2195 2200 2205 



Glu Phe Glu Ser He Glu Glu Ala Leu Pro Asp Thr Asp Val Leu 
2210 2215 2220 



Tyr Met Thr Arg He Gin Lys Glu Arg Phe Gly Ser Thr Gin Glu 
2225 2230 2235 



Tyr Glu Ala Cys Phe Gly Gin Phe He Leu Thr Pro His He Met 
2240 2245 2250 



Thr Arg Ala Lys Lys Lys Met Val Val Met His Pro Met Pro Arg 
2255 2260 2265 



Val Asn Glu lie Ser Val Glu Val Asp Ser Asp Pro Arg Ala Ala 
2270 2275 " ~ 2280 



Tyr Phe Arg Gin Ala Glu Asn Gly Met Tyr He Arg Met Ala Leu 
2285 2290 2295 
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Leu Ala Thr Val Leu Gly Arg Phe 
2300 2305 



<210> 128 

<211> 1651 

<212> PRT 

<213> Homo sapien 



<400> 128 

Pro Pro Pro Leu Ser Ser Leu Pro Met Ala Ala Leu Val Leu Glu Asp 
15 io 15 



Gly Ser Val Leu Arg Gly Gin Pro Phe Gly Ala Ala Val Ser Thr Ala 
20 25 30 



Gly Glu Val Val Phe Gin Thr Gly Met Val Gly Tyr Pro Glu Ala Leu 
35 40 45 



Thr Asp Pro Ser Tyr Lys Ala Gin lie Leu Val Leu Thr Tyr Pro Leu 
50 55 60 



lie Gly Asn Tyr Gly He Pro Pro Asp Glu Met Asp Glu Phe Gly Leu 
65 ' * 70 75 80 



Cys Lys Trp Phe Glu Ser Ser Gly He His Val Ala Ala Leu Val Val 
85 90 95 



Gly Glu Cys Cys Pro Thr Pro Ser His Trp Ser Ala Thr Arg Thr Leu 
100 105 HO 



His Glu Trp Leu Gin Gin His Gly He Pro Gly Leu Gin Gly Val Asp 
115 120 125 



Thr Arg Glu Leu Thr Lys Lys Leu Arg Glu Gin Gly Ser Leu Leu Gly 
130 135 140 



Lys Leu Val Gin Asn Gly Thr Glu Pro Ser Ser Leu Pro Phe Leu Asp 
145 150 155 . 160 



Pro Asn Ala Arg Pro Leu Val Pro Glu Val Ser He Lys Thr Pro Arg 
165 170 175 



Val Phe Asn Thr Gly Gly Ala Pro Arg He Leu Ala Leu Asp Cys Gly 
180 185 190 
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Leu Lys Tyr Asn Gin He Arg Cys Leu Cys Gin Arg Gly Ala Glu Val 
195 200 205 



Thr Val Val ProTrp Asp His Ala Leu Asp Ser Gin Glu Tyr Glu Gly 
210 215 220 



Leu Phe Leu Ser Asn Gly Pro Gly Asp Pro Ala Ser Tyr Pro Ser Val 
225 230 235 240 



Val Ser Thr Leu Ser Arg Val Leu Ser Glu Pro Asn Pro Arg Pro Val 
245 250 255 



Phe Gly He Cys Leu Gly His Gin Leu Leu Ala Leu Ala He Gly Ala 
260 265 270 



Lys Thr Tyr Lys Met Arg Tyr Gly Asn Arg Gly His Asn Gin Pro Cys 
275 280 285 



Leu Leu Val Gly Ser Gly Arg Cys Phe Leu Thr Ser Gin Asn His Gly 
290 295 300 



Phe Ala Val Glu Thr Asp Ser Leu Pro Ala Asp Trp Ala Pro Leu Phe 
305 310 315 " 320 



Thr Asn Ala Asn Asp Gly Ser Asn Glu Gly He Val His Asn Ser Leu 
325 330 335 



Pro Phe Phe Ser Val Gin Phe His Pro Glu His Gin Ala Gly Pro Ser 
340 345 350 



Asp Met Glu Leu Leu Phe Asp He Phe Leu Glu Thr Val Lys Glu Ala 
355 360 365 



Thr Ala Gly Asn Pro Gly Gly Gin Thr Val Arg Glu Arg Leu Thr Glu 
370 375 380 



Arg Leu Cys Pro Pro Gly He Pro Thr Pro Gly Ser Gly Leu Pro Pro 
385 390 395 400 



Pro Arg Lys Val Leu He Leu Gly Ser Gly Gly Leu Ser He Gly Gin 
405 410 415 



Ala Gly Glu Phe Asp Tyr Ser Gly Ser Gin Ala He Lys Ala Leu Lys 
420 425 * 430 



Glu Glu Asn He Gin Thr Leu Leu He Asn Pro Asn He Ala Thr Val 
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435 



440 



445 



Gin Thr Ser Gin Gly Leu Ala Asp Lys Val Tyr Phe Leu Pro He Thr 
450 455 460 



Pro His Tyr Val Thr Gin Val He Arg Asn Glu Arg Pro Asp Gly Val 
465 ~ 470 475 480 



Leu Leu Thr Phe Gly Gly Gin Thr Ala Leu Asn Cys Gly Val Glu Leu 
485 490 495 



Thr Lys Ala Gly Val Leu Ala Arg Tyr Gly Val Arg Val Leu Gly Thr 
500 505 510 



Thr Val Glu Thr He Glu Leu Thr Glu Asp Arg Arg Ala Phe Ala Ala 
515 520 525 



Arg Met Ala Glu He Gly Glu His Val Ala Pro Ser Glu Ala Gly Asn 
530 535 540 



Ser Leu Glu Gin Ala Gin Ala Ala Ala Glu Arg Leu Gly Tyr Pro Val 
545 550 555 560 



Leu Val Arg Ala Ala Phe Ala Val Gly Gly Leu Gly Ser Gly Phe Ala 
565 570 575 



Ser Asn Arg Glu Glu Leu Ser Ala Leu Val Ala Pro Ala Phe Ala His 
580 585 590 



Thr Ser Gin Val Leu Val Asp Lys Ser Leu Lys Gly Trp Lys Glu He 
595 600 605 



Glu Tyr Glu Val Val Arg Asp Ala Tyr Gly Asn Cys Val Thr Val Cys 
610 615 620 



Asn Met Glu Asn Leu Asp Pro Leu Gly He His Thr Gly Glu Ser He 
625 630 635 640 



Val Val Ala Pro Ser Gin Thr Leu Asn Asp Arg Glu Tyr Gin Leu Leu 
645 650 655 



Arg Gin Thr Ala He Lys Val Thr Gin His Leu Gly He Val Gly Glu 
660 665 670 



Cys Asn Val Gin Tyr Ala Leu Asn Pro Glu Ser Glu Gin Tyr Tyr He 
675 680 685 
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lie Glu Val Asn Ala Arg Leu Ser Arg Ser Ser Ala Leu Ala Ser Lys 
690 695 700 



Ala Thr Gly Tyr Pro Leu Ala Tyr Val Ala Ala Lys Leu Ala Leu Gly 
705 710 715 720 



He Pro Leu Pro Glu Leu Arg Asn Ser Val Thr Gly Gly Thr Ala Ala 
725 730 735 



Phe Glu Pro Ser Val Asp Tyr Cys Val Val Lys He Pro Arg Trp Asp 
740 745 750 



Leu Ser Lys Phe Leu Arg Val Ser Thr Lys He Gly Ser Cys Met Lys 
755 760 765 



Ser Val Gly Glu Val Met Gly He Gly Arg Ser Phe Glu Glu Ala Phe 
770 775 780 



Gin Lys Ala Leu Arg Met Val Asp Glu Asn Cys Val Gly Phe Asp His 
785 790 795 800 



Thr Val Lys Pro Val Ser Asp Met Glu Leu Glu Thr Pro Thr Asp Lys 
805 810 815 



Arg He Phe Val Val Ala Ala Ala Leu Trp Ala Gly Tyr Ser Val Asp 
820 825 830 



Arg Leu Tyr Glu Leu Thr Arg He Asp Arg Trp Phe Leu His Arg Met 
835 840 845 



Lys Arg He He Ala His Ala Gin Leu Leu Glu Gin His Arg Gly Gin 
850 855 860 



Pro Leu Pro Pro Asp Leu Leu Gin Gin Ala Lys Cys Leu Gly Phe Ser 
865 870 875 880 



Asp Lys Gin He Ala Leu Ala Val Leu Ser Thr Glu Leu Ala Val Arg 
885 890 895 



Lys Leu Arg Gin Glu Leu Gly He Cys Pro Ala Val Lys Gin He Asp 
900 905 910 



Thr Val Ala Ala Glu Trp Pro Ala Gin Thr Asn Tyr Leu Tyr Leu Thr 
915 920 925 
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Tyr Trp Gly Thr Thr His Asp Leu Thr Phe Arg Thr Pro His Val Leu 
930 935 940 



Val Leu Gly Ser Gly Val Tyr Arg He Gly Ser Ser Val Glu Phe Asp 
945 " ^ 950 955 960 



Trp Cys Ala Val Gly Cys He Gin Gin Leu Arg Lys Met Gly Tyr Lys 
965 970 975 



Thr He Met Val Asn Tyr Asn Pro Glu Thr Val Ser Thr Asp Tyr Asp 
980 985 990 



Met Cys Asp Arg Leu Tyr Phe Asp Glu He Ser Phe Glu Val Val Met 
995 1000 1005 



Asp He Tyr Glu Leu Glu Asn Pro Glu Gly Val He Leu Ser Met 
1010 1015 1020 



Gly Gly Gin Leu Pro Asn Asn Met Ala Met Ala Leu His Arg Gin 
1025 1030 1035 



Gin Cys Arg Val Leu Gly Thr Ser Pro Glu Ala He Asp Ser Ala 
1040 1045 1050 



Glu Asn Arg Phe Lys Phe Ser Arg Leu Leu Asp Thr He Gly He 
1055 1060 1065 



Ser Gin Pro Gin Trp Arg Glu Leu Ser Asp Leu Glu Ser Ala Arg 
1070 1075 1080 



Gin Phe Cys Gin Thr Val Gly Tyr Pro Cys Val Val Arg Pro Ser 
1085 1090 1095 



Tyr Val Leu Ser Gly Ala Ala Met Asn Val Ala Tyr Thr Asp Gly 
1100 1105 1110 



Asp Leu Glu Arg Phe Leu Ser Ser Ala Ala Ala Val Ser Lys Glu 
1115 1120 1125 



His Pro Val Val He Ser Lys Phe He Gin Glu Ala Lys Glu He 
1130 1135 H40 



Asp Val Asp Ala Val Ala Ser Asp Gly Val Val Ala Ala He Ala 
1145 1150 1155 
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He Ser Glu His Val Glu Asn Ala Gly Val His Ser Gly Asp Ala 
1160 1165 1170 



Thr Leu Val Thr Pro Pro Gin Asp He Thr Ala Lys Thr Leu Glu 
1175 1180 1185 



Arg He Lys Ala He Val His Ala Val Gly Gin Glu Leu Gin Val 
1190 1195 1200 



Thr Gly Pro Phe Asn Leu Gin Leu lie Ala Lys Asp Asp Gin Leu 
1205 1210 1215 



Lys Val He Glu Cys Asn Val Arg Val Ser Arg Ser Phe Pro Phe 
1220 1225 1230 



Val Ser Lys Thr Leu Gly Val Asp Leu Val Ala Leu Ala Thr Arg 
1235 1240 1245 



Val He Met Gly Glu Glu Val Glu Pro Val Gly Leu Met Thr Gly 
1250 1255 1260 



Ser Gly Val Val Gly Val Lys Val Pro Gin Phe Ser Phe Ser Arg 
1265 1270 1275 



Leu Ala Gly Ala Asp Val Val Leu Gly Val Glu Met Thr Ser Thr 
1280 1285 1290 



Gly Glu Val Ala Gly Phe Gly Glu Ser Arg Cys Glu Ala Tyr Leu 
1295 1300 1305 



Lys Ala Met Leu Ser Thr Gly Phe Lys He Pro Lys Lys Asn He 
1310 1315 1320 



Leu Leu Thr He Gly Ser Tyr Lys Asn Lys Ser Glu Leu Leu Pro 
1325 1330 1335 



Thr Val Arg Leu Leu Glu Ser Leu Gly Tyr Ser Leu Tyr Ala Ser 
1340 1345 1350 



Leu Gly Thr Ala Asp Phe Tyr Thr Glu His Gly Val Lys Val Thr 
1355 1360 1365 



Ala Val Asp Trp His Phe Glu Glu Ala Val Asp Gly Glu Cys Pro 
1370 1375 1380 



Pro Gin Arg Ser He Leu Glu Gin Leu Ala Glu Lys Asn Phe Glu 



WO 2004/050858 



198/383 



PCT/US2003/038808 



1385 1390 1395 



Leu Val He Asn Leu Ser Met Arg Gly Ala Gly Gly Arg Arg Leu 
1400 1405 1410 



Ser Ser Phe Val Thr Lys Gly Tyr Arg Thr Arg Arg Leu Ala Ala 
1415 1420 1425 



Asp Phe Ser Val Pro Leu lie He Asp He Lys Cys Thr Lys Leu 
1430 1435 1440 



Phe Val Glu Ala Leu Gly Gin He Gly Pro Ala Pro Pro Leu Lys 
1445 1450 1455 



Val His Val Asp Cys Met Thr Ser Gin Lys Leu Val Arg Leu Pro 
1460 1465 1470 



Gly Leu He Asp Val His Val His Leu Arg Glu Pro Gly Gly Thr 
1475 1480 1485 



His Lys Glu Asp Phe Ala Ser Gly Thr Ala Ala Ala Leu Ala Gly 
1490 1495 1500 



Gly He Thr Met Val Cys Ala Met Pro Asn Thr Arg Pro Pro He 
1505 1510 1515 



He Asp Ala Pro Ala Leu Ala Leu Ala Gin Lys Leu Ala Glu Ala 
1520 1525 1530 



Gly Ala Arg Cys Asp Phe Ala Leu Phe Leu Gly Ala Ser Ser Glu 
1535 1540 1545 



Asn Ala Gly Thr Leu Gly Thr Val Ala Gly Ser Ala Ala Gly Leu 
1550 1555 1560 



Lys Leu Tyr Leu Asn Glu Thr Phe Ser Glu Leu Arg Leu Asp Ser 
1565 1570 1575 



Val Val Gin Trp Met Glu His Phe Glu Thr Trp Pro Ser His Leu 
1580 1585 1590 



Pro He Val Ala His Ala Glu Gin Gin Thr Val Ala Ala Val Leu 
1595 1600 1605 



Met Val Ala Gin Leu Thr Gin Arg Ser Val His He Cys His Val 
1610 1615 1620 
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Ala Arg Lys Glu Glu Val Arg Val His Leu Arg Ser Cys Cys Pro 
1625 1630 . 1635 



Cys Cys Phe Pro Ser Asn Thr Lys Gly Gin Gly Ser Pro 
1640 1645 1650 



<210> 129 

<211> 2242 

<212> PRT 

<213> Homo sapien 

<400> 129 

Met Ala Ala Leu Val Leu Glu Asp Gly Ser Val Leu Arg Gly Gin Pro 
15 10 15 



Phe Gly Ala Ala Val Ser Thr Ala Gly Glu Val Val Phe Gin Thr Gly 
20 25 30 



Met Val Gly Tyr Pro Glu Ala Leu Thr Asp Pro Ser Tyr Lys Ala Gin 
35 40 45 



lie Leu Val Leu Thr Tyr Pro Leu lie Gly Asn Tyr Gly He Pro Pro 
50 55 60 



Asp Glu Met Asp Glu Phe Gly Leu Cys Lys Trp Phe Glu Ser Ser Gly 
65 70 75 80 



He His Val Ala Ala Leu Val Val Gly Glu Cys Cys Pro Thr Pro Ser 
85 90 95 



His Trp Ser Ala Thr Arg Thr Leu His Glu Trp Leu Gin Gin His Gly 
100 105 110 



He Pro Gly Leu Gin Gly Val Asp Thr Arg Glu Leu Thr Lys Lys Leu 
115 120 125 



Arg Glu Gin Gly Ser Leu Leu Gly Lys Leu Val Gin Asn Gly Thr Glu 
130 135 140 



Pro Ser Ser Leu Pro Phe Leu Asp Pro Asn Ala Arg Pro Leu Val Pro 
145 150 155 160 



Glu Val Ser He Lys Thr Pro Arg Val Phe Asn Thr Gly Gly Ala Pro 
165 170 175 
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Arg He Leu Ala Leu Asp Cys Gly Leu Lys Tyr Asn Gin He Arg Cys 
180 185 190 



Leu Cys Gin Arg Gly Ala Glu Val Thr Val Val Pro Trp Asp His Ala 
195 200 205 



Leu Asp Ser Gin Glu Tyr Glu Gly Leu Phe Leu Ser Asn Gly Pro Gly 
210 215 220 



Asp Pro Ala Ser Tyr Pro Ser Val Val Ser Thr Leu Ser Arg Val Leu 
225 230 235 240 



Ser Glu Pro Asn Pro Arg Pro Val Phe Gly He Cys Leu Gly His Gin 
245 250 255 



Leu Leu Ala Leu Ala He Gly Ala Lys Thr Tyr Lys Met Arg Tyr Gly 
260 265 270 



Asn Arg Gly His Asn Gin Pro Cys Leu Leu Val Gly Ser Gly Arg Cys 
275 280 285 



Phe Leu Thr Ser Gin Asn His Gly Phe Ala Val Glu Thr Asp Ser Leu 
290 295 300 



Pro Ala Asp Trp Ala Pro Leu Phe Thr Asn Ala Asn Asp Gly Ser Asn 
305 310 315 320 



Glu Gly He Val His Asn Ser Leu Pro Phe Phe Ser Val Gin Phe His 
325 330 335 



Pro Glu His Gin Ala Gly Pro Ser Asp Met Glu Leu Leu Phe Asp He 
340 345 350 



Phe Leu Glu Thr Val Lys Glu Ala Thr Ala Gly Asn Pro Gly Gly Gin 
355 360 365 



Thr Val Arg Glu Arg Leu Thr Glu Arg Leu Cys Pro Pro Gly He Pro 
370 375 380 



Thr Pro Gly Ser Gly Leu Pro Pro Pro Arg Lys Val Leu lie Leu Gly 
385 390 395 400 



Ser Gly Gly Leu Ser He Gly Gin Ala Gly Glu Phe Asp Tyr Ser Gly 
405 410 415 



Ser Gin Ala He Lys Ala Leu Lys Glu Glu Asn He Gin Thr Leu Leu 
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420 



425 



430 



He Asn Pro Asn He Ala Thr Val Gin Thr Ser Gin Gly Leu Ala Asp 
435 44 0 445 



Lys Val Tyr Phe Leu Pro He Thr Pro His Tyr Val Thr Gin Val He 
450 455 460 



Arg Asn Glu Arg Pro Asp Gly Val Leu Leu Thr Phe Gly Gly Gin Thr 
465 470 475 480 



Ala Leu Asn Cys Gly Val Glu Leu Thr Lys Ala Gly Val Leu Ala Arg 
485 490 495 



Tyr Gly Val Arg Val Leu Gly Thr Thr Val Glu Thr He Glu Leu Thr 
500 505 510 



Glu Asp Arg Arg Ala Phe Ala Ala Arg Met Ala Glu He Gly Glu His 
515 520 525 



Val Ala Pro Ser Glu Ala Gly Asn Ser Leu Glu Gin Ala Gin Ala Ala 
530 535 540 



Ala Glu Arg Leu Gly Tyr Pro Val Leu Val Arg Ala Ala Phe Ala Val 
545 550 555 560 



Gly Gly Leu Gly Ser Gly Phe Ala Ser Asn Arg Glu Glu Leu Ser Ala 
565 570 575 



Leu Val Ala Pro Ala Phe Ala His Thr Ser Gin Val Leu Val Asp Lys 
580 585 590 



Ser Leu Lys Gly Trp Lys Glu He Glu Tyr Glu Val Val Arg Asp Ala 
595 600 605 



Tyr Gly Asn Cys Val Thr Val Cys Asn Met Glu Asn Leu Asp Pro Leu 
610 615 620 



Gly He His Thr Gly Glu Ser He Val Val Ala Pro Ser Gin Thr Leu 
625 630 635 640 



Asn Asp Arg Glu Tyr Gin Leu Leu Arg Gin Thr Ala He Lys Val Thr 
645 650 655 



Gin His Leu Gly He Val Gly Glu Cys Asn Val Gin Tyr Ala Leu Asn 
660 665 670 
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Pro Glu Ser Glu Gin Tyr Tyr He He Glu Val Asn Ala Arg Leu Ser 
675 680 685 



Arg Ser Ser Ala Leu Ala Ser Lys Ala Thr Gly Tyr Pro Leu Ala Tyr 
690 695 700 



Val Ala Ala Lys Leu Ala Leu Gly He Pro Leu Pro Glu Leu Arg Asn 
705 710 715 720 



Ser Val Thr Gly Gly Thr Ala Ala Phe Glu Pro Ser Val Asp Tyr Cys 
725 730 735 



Val Val Lys He Pro Arg Trp Asp Leu Ser Lys Phe Leu Arg Val Ser 
740 745 750 



Thr Lys He Gly Ser Cys Met Lys Ser Val Gly Glu Val Met Gly He 
755 *" 760 765 



Gly Arg Ser Phe Glu Glu Ala Phe Gin Lys Ala Leu Arg Met Val Asp 
770 775 780 



Glu Asn Cys Val Gly Phe Asp His Thr Val Lys Pro Val Ser Asp Met 
785 * 790 795 800 



Glu Leu Glu Thr Pro Thr Asp Lys Arg He Phe Val Val Ala Ala Ala 
805 810 815 



Leu Trp Ala Gly Tyr Ser Val Asp Arg Leu Tyr Glu Leu Thr Arg He 
820 825 830 



Asp Arg Trp Phe Leu His Arg Met Lys Arg He He Ala His Ala Gin 
835 840 845 



Leu Leu Glu Gin His Arg Gly Gin Pro Leu Pro Pro Asp Leu Leu Gin 
850 855 860 



Gin Ala Lys Cys Leu Gly Phe Ser Asp Lys Gin He Ala Leu Ala Val 
865 870 875 880 



Leu Ser Thr Glu Leu Ala Val Arg Lys Leu Arg Gin Glu Leu Gly He 
885 890 895 



Cys Pro Ala Val Lys Gin He Asp Thr Val Ala Ala Glu Trp Pro Ala 
900 905 910 
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Gin Thr Asn Tyr Leu Tyr Leu Thr Tyr Trp Gly Thr Thr His Asp Leu 
915 920 925 



Thr Phe Arg Thr Pro His Val Leu Val Leu Gly Ser Gly Val Tyr Arg 
930 935 940 



He Gly Ser Ser Val Glu Phe Asp Trp Cys Ala Val Gly Cys lie Gin 
945 950 955 960 



Gin Leu Arg Lys Met Gly Tyr Lys Thr He Met Val Asn Tyr Asn Pro 
965 970 975 



Glu Thr Val Ser Thr Asp Tyr Asp Met Cys Asp Arg Leu Tyr Phe Asp 
980 985 990 



Glu He Ser Phe Glu Val Val Met Asp He Tyr Glu Leu Glu Asn Pro 
995 1000 1005 



Glu Gly Val lie Leu Ser Met Gly Gly Gin Leu Pro Asn Asn Met 
1010 1015 1020 



Ala Met Ala Leu His Arg Gin Gin Cys Arg Val Leu Gly Thr Ser 
1025 1030 1035 



Pro Glu Ala He Asp Ser Ala Glu Asn Arg Phe Lys Phe Ser Arg 
1040 1045 1050 



Leu Leu Asp Thr He Gly He Ser Gin Pro Gin Trp Arg Glu Leu 
1055 1060 1065 



Ser Asp Leu Glu Ser Ala Arg Gin Phe Cys Gin Thr Val Gly Tyr 
1070 1075 1080 



Pro Cys Val Val Arg Pro Ser Tyr Val Leu Ser Gly Ala Ala Met 
1085 1090 1095 



Asn Val Ala Tyr Thr Asp Gly Asp Leu Glu Arg Phe Leu Ser Ser 
1100 1105 1110 



Ala Ala Ala Val Ser Lys Glu His Pro Val Val He Ser Lys Phe 
1115 1120 1125 



lie Gin Glu Ala Lys Glu He Asp Val Asp Ala Val Ala Ser Asp 
1130 1135 1140 
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Gly Val Val Ala Ala lie Ala lie Ser Glu His Val Glu Asn Ala 
1145 1150 H55 



Gly Val His Ser Gly Asp Ala Thr Leu Val Thr Pro Pro Gin Asp 
1160 1165 H70 



lie Thr Ala Lys Thr Leu Glu Arg He Lys Ala He Val His Ala 
1175 H80 H85 



Val Gly Gin Glu Leu Gin Val Thr Gly Pro Phe Asn Leu Gin Leu 
1190 H95 1200 



He Ala Lys Asp Asp Gin Leu Lys Val He Glu Cys Asn Val Arg 
1205 1210 1215 



Val Ser Arg Ser Phe Pro Phe Val Ser Lys Thr Leu Gly Val Asp 
1220 ~ 1225 1230 



Leu Val Ala Leu Ala Thr Arg Val He Met Gly Glu Glu Val Glu 
1235 1240 1245 



Pro Val Gly Leu Met Thr Gly Ser Gly Val Val Gly Val Lys Val 
1250 1255 1260 



Pro Gin Phe Ser Phe Ser Arg Leu Ala Gly Ala Asp Val Val Leu 
1265 1270 1275 



Gly Val Glu Met Thr Ser Thr Gly Glu Val Ala Gly Phe Gly Glu 
1280 1285 1290 



Ser Arg Cys Glu Ala Tyr Leu Lys Ala Met Leu Ser Thr Gly Phe 
1295 1300 1305 



Lys He Pro Lys Lys Asn He Leu Leu Thr He Gly Ser Tyr Lys 
1310 1315 1320 



Asn Lys Ser Glu Leu Leu Pro Thr Val Arg Leu Leu Glu Ser Leu 
1325 1330 1335 



Gly Tyr Ser Leu Tyr Ala Ser Leu Gly Thr Ala Asp Phe Tyr Thr 
1340 1345 1350 



Glu His Gly Val Lys Val Thr Ala Val Asp Trp His Phe Glu Glu 
1355 1360 1365 



Ala Val Asp Gly Glu Cys Pro Pro Gin Arg Ser He Leu Glu Gin 
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1370 1375 1380 



Leu Ala Glu Lys Asn Phe Glu Leu Val lie Asn Leu Ser Met Arg 
1385 1390 1395 



Gly Ala Gly Gly Arg Arg Leu Ser Ser Phe Val Thr Lys Gly Tyr 
1400 1405 1410 



Arg Thr Arg Arg Leu Ala Ala Asp Phe Ser Val Pro Leu lie lie 
1415 1420 1425 



Asp He Lys Cys Thr Lys Leu Phe Val Glu Ala Leu Gly Gin He 
1430 1435 1440 



Gly Pro Ala Pro Pro Leu Lys Val His Val Asp Cys Met Thr Ser 
1445 1450 1455 



Gin Lys Leu Val Arg Leu Pro Gly Leu He Asp Val His Val His 
1460 1465 1470 



Leu Arg Glu Pro Gly Gly Thr His Lys Glu Asp Phe Ala Ser Gly 
1475 1480 1485 



Thr Ala Ala Ala Leu Ala Gly Gly He Thr Met Val Cys Ala Met 
1490 1495 1500 



Pro Asn Thr Arg Pro Pro lie He Asp Ala Pro Ala Leu Ala Leu 
1505 1510 1515 



Ala Gin Lys Leu Ala Glu Ala Gly Ala Arg Cys Asp Phe Ala Leu 
1520 1525 1530 



Phe Leu Gly Ala Ser Ser Glu Asn Ala Gly Thr Leu Gly Thr Val 
1535 1540 1545 



Ala Gly Ser Ala Ala Gly Leu Lys Leu Tyr Leu Asn Glu Thr Phe 
1550 1555 1560 



Ser Glu Leu Arg Leu Asp Ser Val Val Gin Trp Met Glu His Phe 
1565 1570 1575 



Glu Thr Trp Pro Ser His Leu Pro He Val Ala His Ala Glu Gin 
1580 1585 1590 



Gin Thr Val Ala Ala Val Leu Met Val Ala Gin Leu Thr Gin Arg 
1595 1600 1605 
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Ser Val His He Cys His Val Ala Arg Lys Glu Glu He Leu Leu 
1610 1615 1620 



He Lys Ala Ala Lys Ala Arg Gly Leu Pro Val Thr Cys Glu Val 
1625 1630 1635 



Ala Pro His His Leu Phe Leu Ser His Asp Asp Leu Glu Arg Leu 
1640 1645 1650 



Gly Pro Gly Lys Gly Glu Val Arg Pro Glu Leu Gly Ser Arg Gin 
1655 1660 1665 



Asp Val Glu Ala Leu Trp Glu Asn Met Ala Val He Asp Cys Phe 
1670 1675 1680 



Ala Ser Asp His Ala Pro His Thr Leu Glu Glu Lys Cys Gly Ser 
1685 1690 1695 



Arg Pro Pro Pro Gly Phe Pro Gly Leu Glu Thr Met Leu Pro Leu 
1700 1705 1710 



Leu Leu Thr Ala Val Ser Glu Gly Arg Leu Ser Leu Asp Asp Leu 
1715 1720 1725 



Leu Gin Arg Leu His His Asn Pro Arg Arg He Phe His Leu Pro 
1730 1735 1740 



Pro Gin Glu Asp Thr Tyr Val Glu Val Asp Leu Glu His Glu Trp 
1745 1750 1755 



Thr He Pro Ser His Met Pro Phe Ser Lys Ala His Trp Thr Pro 
1760 1765 1770 



Phe Glu Gly Gin Lys Val Lys Gly Thr Val Arg Arg Val Val Leu 
1775 ^ 1780 1785 



Arg Gly Glu Val Ala Tyr He Asp Gly Gin Val Leu Val Pro Pro 
1790 1795 1800 



Gly Tyr Gly Gin Asp Val Arg Lys Trp Pro Gin Gly Ala Val Pro 
1805 1810 1815 



Gin Leu Pro Pro Ser Ala Pro Ala Thr Ser Glu Met Thr Thr Thr 
1820 1825 1830 
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Pro Glu Arg Pro Arg Arg Gly lie Pro Gly Leu Pro Asp Gly Arg 
1835 1840 1845 



Phe His Leu Pro Pro Arg lie His Arg Ala Ser Asp Pro Gly Leu 
1850 1855 1860 



Pro Ala Val Phe Leu Arg Pro Gly Ala Gly lie Pro Arg Gly Ser 
1865 1870 1875 



Arg Ala Trp Ala Glu Glu Pro Lys Glu Lys Ser Ser Arg Lys Val 
1880 1885 1890 



Ala Glu Pro Glu Leu Met Gly Thr Pro Asp Gly Thr Cys Tyr Pro 
1895 1900 1905 



Pro Pro Pro Val Pro Arg Gin Ala Ser Pro Gin Asn Leu Gly Thr 
1910 1915 1920 



Pro Gly Leu Leu His Pro Gin Thr Ser Pro Leu Leu His Ser Leu 
1925 1930 1935 



Val Gly Gin His lie Leu Ser Val Gin Gin Phe Thr Lys Asp Gin 
1940 1945 1950 



Met Ser His Leu Phe Asn Val Ala His Thr Leu Arg Met Met Val 
1955 1960 1965 



Gin Lys Glu Arg Ser Leu Asp lie Leu Lys Gly Lys Val Met Ala 
1970 1975 1980 



Ser Met Phe Tyr Glu Val Ser Thr Arg Thr Ser Ser Ser Phe Ala 
1985 1990 1995 



Ala Ala Met Ala Arg Leu Gly Gly Ala Val Leu Ser Phe Ser Glu 
2000 2005 2010 



Ala Thr Ser Ser Val Gin Lys Gly Glu Ser Leu Ala Asp Ser Val 
2015 2020 2025 



Gin Thr Met Ser Cys Tyr Ala Asp Val Val Val Leu Arg His Pro 
2030 2035 2040 



Gin Pro Gly Ala Val Glu Leu Ala Ala Lys His Cys Arg Arg Pro 
2045 2050 2055 
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Val He Asn Ala Gly Asp Gly Val Gly Glu His Pro Thr Gin Ala 
2060 2065 2070 



Leu Leu Asp He Phe Thr He Arg Glu Glu Leu Gly Thr Val Asn 
2075 2080 2085 



Gly Met Thr He Thr Met Val Gly Asp Leu Lys His Gly Arg Thr 
2090 2095 2100 



Val His Ser Leu Ala Cys Leu Leu Thr Gin Tyr Arg Val Ser Leu 
2105 2110 2115 



Arg Tyr Val Ala Pro Pro Ser Leu Arg Met Pro Pro Thr Val Arg 
2120 2125 2130 



Ala Phe Val Ala Ser Arg Gly Thr Lys Gin Glu Glu Phe Glu Ser 
2135 2140 2145 



He Glu Glu Ala Leu Pro Asp Thr Asp Val Leu Tyr Met Thr Arg 
2150 2155 2160 



He Gin Lys Glu Arg Phe Gly Ser Thr Gin Glu Tyr Glu Ala Cys 
2165 2170 2175 



Phe Gly Gin Phe He Leu Thr Pro His He Met Thr Arg Ala Lys 
2180 2185 2190 



Lys Lys Met Val Val Met His Pro Met Pro Arg Val Asn Glu He 
2195 2200 2205 



Ser Val Glu Val Asp Ser Asp Pro Arg Ala Ala Tyr Phe Arg Gin 
2210 2215 2220 



Ala Glu Asn Gly Met Tyr He Arg Met Ala Leu Leu Ala Thr Val 
2225 2230 2235 



Leu Gly Arg Phe 
2240 



<210> 130 

<211> 371 

<212> PRT 

<213> Homo sapien 

<400> 130 

Met Tyr He Cys Pro Leu Val Leu Phe Ala Ala Glu Glu Pro Lys Glu 
15 10 15 
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Lys Ser Ser Arg Lys Val Ala Glu Pro Glu Leu Met Gly Thr Pro Asp 
20 25 30 



Gly Thr Cys Tyr Pro Pro Pro Pro Val Pro Arg Gin Ala Ser Pro Gin 
35 40 45 



Asn Leu Gly Thr Pro Gly Leu- Leu His Pro Gin Thr Ser Pro Leu Leu 
50 55 60 



His Ser Leu Val Gly Gin His He Leu Ser Val Gin Gin Phe Thr Lys 
65 70 75 80 



Asp Gin Met Ser His Leu Phe Asn Val Ala His Thr Leu Arg Met Met 
85 90 95 



Val Gin Lys Glu Arg Ser Leu Asp He Leu Lys Gly Lys Val Met Ala 
100 105 110 



Ser Met Phe Tyr Glu Val Ser Thr Arg Thr Ser Ser Ser Phe Ala Ala 
115 * 120 125 



Ala Met Ala Arg Leu Gly Gly Ala Val Leu Ser Phe Ser Glu Ala Thr 
130 135 140 



Ser Ser Val Gin Lys Gly Glu Ser Leu Ala Asp Ser Val Gin Thr Met 
145 150 155 160 



Ser Cys Tyr Ala Asp Val Val Val Leu Arg His Pro Gin Pro Gly Ala 
165 170 175 



Val Glu Leu Ala Ala Lys His Cys Arg Arg Pro Val He Asn Ala Gly 
180 185 190 



Asp Gly Val Gly Glu His Pro Thr Gin Ala Leu Leu Asp He Phe Thr 
195 200 205 



He Arg Glu Glu Leu Gly Thr Val Asn Gly Met Thr He Thr Met Val 
210 215 220 



Gly Asp Leu Lys His Gly Arg Thr Val His Ser Leu Ala Cys Leu Leu 
225 230 235 240 



Thr Gin Tyr Arg Val Ser Leu Arg Tyr Val Ala Pro Pro Ser Leu Arg 
245 250 255 
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Met Pro Pro Thr Val Arg Ala Phe Val Ala Ser Arg Gly Thr Lys Gin 
260 265 270 



Glu Glu Phe Glu Ser He Glu Glu Ala Leu Pro Asp Thr Asp Val Leu 
275 280 285 



Tyr Met Thr Arg He Gin Lys Glu Arg Phe Gly Ser Thr Gin Glu Tyr 
290 295 300 



Glu Ala Cys Phe Gly Gin Phe lie Leu Thr Pro His He Met Thr Arg 
305 310 315 320 



Ala Lys Lys Lys Met Val Val Met His Pro Met Pro Arg Val Asn Glu 
325 330 335 



He Ser Val Glu Val Asp Ser Asp Pro Arg Ala Ala Tyr Phe Arg Gin 
340 345 350 



Ala Glu Asn Gly Met Tyr He Arg Met Ala Leu Leu Ala Thr Val Leu 
355 360 365 



Gly Arg Phe 
370 



<210> 131 

<211> 970 

<212> PRT 

<213> Homo sapien 

<400> 131 

Met Ala Gly Ala Leu Ala Gly Leu Ala Ala Gly Leu Gin Val Pro Arg 
1 5 10 15 



Val Ala Pro Ser Pro Asp Ser Asp Ser Asp Thr Asp Ser Glu Asp Pro 
20 25 30 



Ser Leu Arg Arg Ser Ala Gly Gly Leu Leu Arg Ser Gin Val He His 
35 40 45 



Ser Gly His Phe Met Val Ser Ser Pro His Ser Asp Ser Leu Pro Arg 
50 55 60 



Arg Arg Asp Gin Glu Gly Ser Val Gly Pro Ser Asp Phe Gly Pro Arg 
65 70 75 80 



Ser He Asp Pro Thr Leu Thr Arg Leu Phe Glu Cys Leu Ser Leu Ala 
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85 



90 



95 



Tyr Ser Gly Lys Leu Val Ser Pro Lys Trp Lys Asn Phe Lys Gly Leu 
100 105 110 



Lys Leu Leu Cys Arg Asp Lys lie Arg Leu Asn Asn Ala lie Trp Arg 
115 120 125 



Ala Trp Tyr lie Gin Tyr Val Lys Arg Arg Lys Ser Pro Val Cys Gly 
130 135 140 



Phe Val Thr Pro Leu Gin Gly Pro Glu Ala Asp Ala His Arg Lys Pro 
145 150 155 160 



Glu Ala Val Val Leu Glu Gly Asn Tyr Trp Lys Arg Arg lie Glu Val 
165 170 175 



Val Met Arg Glu Tyr His Lys Trp Arg He Tyr Tyr Lys Lys Arg Val 
180 185 190 



Ser Gly Gly Gly Pro Gly Arg Pro Gin Ser Phe Pro Pro Ala Ala Ala 
195 200 205 



Gly Tyr Arg Pro Pro Arg Lys He Pro Gly Lys Gly He Leu Thr Pro 
210 215 220 



Glu Leu Ala Pro Leu Gly Pro Ser He Gin Ser Arg Ala Asp Ser Ala 
225 230 235 240 



Thr Val Trp Pro Gin Arg Leu Leu Ala Ala Ser Leu Pro Arg Gly Arg 
245 250 255 



Leu Arg Lys Pro Ser Arg Glu Asp Asp Leu Leu Ala Pro Lys Gin Ala 
260 265 270 



Glu Gly Arg Trp Pro Pro Pro Glu Gin Trp Cys Lys Gin Leu Phe Ser 
275 280 285 



Ser Val Val Pro Val Leu Leu Gly Asp Pro Glu Glu Glu Pro Gly Gly 
290 295 300 



Arg Gin Leu Leu Asp Leu Asn Cys Phe Leu Ser Asp He Ser Asp Thr 
305 310 315 320 



Leu Phe Thr Met Thr Gin Ser Gly Pro Ser Pro Leu Gin Leu Pro Pro 
325 330 335 
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Glu Asp Ala Tyr Val Gly Asn Ala Asp Met lie Gin Pro Asp Leu Thr 
340 * 345 350 



Pro Leu Gin Pro Ser Leu Asp Asp Phe Met Asp lie Ser Asp Phe Phe 
355 360 365 



Thr Asn Ser Arg Leu Pro Gin Pro Pro Met Pro Ser Asn Phe Pro Glu 
370 ~ 375 380 



Pro Pro Ser Phe Ser Pro Val Val Asp Ser Leu Phe Ser Ser Gly Thr 
385 390 395 400 



Leu Gly Pro Glu Val Pro Pro Ala Ser Ser Ala Met Thr His Leu Ser 
405 410 415 



Gly His Ser Arg Leu Gin Ala Arg Asn Ser Cys Pro Gly Pro Leu Asp 
420 425 430 



Ser Ser Ala Phe Leu Ser Ser Asp Phe Leu Leu Pro Glu Asp Pro Lys 
435 440 445 



Pro Arg Leu Pro Pro Pro Pro Val Pro Pro Pro Leu Leu His Tyr Pro 
450 455 460 



Pro Pro Ala Lys Val Pro Gly Leu Glu Pro Cys Pro Pro Pro Pro Phe 
465 470 475 480 



Pro Pro Met Ala Pro Pro Thr Ala Leu Leu Gin Glu Glu Pro Leu Phe 
485 490 495 



Ser Pro Arg Phe Pro Phe Pro Thr Val Pro Pro Ala Pro Gly Val Ser 
500 505 510 



Pro Leu Pro Ala Pro Ala Ala Phe Pro Pro Thr Pro Gin Ser Val Pro 
515 520 525 



Ser Pro Ala Pro Thr Pro Phe Pro He Glu Leu Leu Pro Leu Gly Tyr 
530 535 540 



Ser Glu Pro Ala Phe Gly Pro Cys Phe Ser Met Pro Arg Gly Lys Pro 
545 550 555 560 



Pro Ala Pro Ser Pro Arg Gly Gin Lys Ala Ser Pro Pro Thr Leu Ala 
565 570 575 
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Pro Ala Thr Ala Ser Pro Pro Thr Thr Ala Gly Ser Asn Asn Pro Cys 
580 585 590 



Leu Thr Gin Leu Leu Thr Ala Ala Lys Pro Glu Gin Ala Leu Glu Pro 
595 600 605 



Pro Leu Val Ser Ser Thr Leu Leu Arg Ser Pro Gly Ser Pro Gin Glu 
610 615 620 



Thr Val Pro Glu Phe Pro Cys Thr Phe Leu Pro Pro Thr Pro Ala Pro 
625 630 635 640 



Thr Pro Pro Arg Pro Pro Pro Gly Pro Ala Thr Leu Ala Pro Ser Arg 
645 650 655 



Pro Leu Leu Val Pro Lys Ala Glu Arg Leu Ser Pro Pro Ala Pro Ser 
660 665 670 



Gly Lys Glu Arg Gly Cys Arg Asp Trp Gly Asn Leu Gly Asp Glu Gly 
675 680 685 



Lys Arg Arg Lys Lys Lys Gly Thr Gly Ser Gly Ala Arg Trp Glu Glu 
690 695 700 



Gly Gly Asn Gly Asp Pro Phe Pro Trp Val Gly Leu Ser Val Gly Leu 
705 710 715 720 



Ser Val Pro Thr Thr Gly Ser Glu Arg Arg Leu Ser Gly Asp Leu Ser 
725 • 730 735 



Ser Met Pro Gly Pro Gly Thr Leu Ser Val Arg Val Ser Pro Pro Gin 
740 745 750 



Pro lie Leu Ser Arg Gly Arg Pro Asp Ser Asn Lys Thr Glu Asn Arg 
755 760 765 



Arg lie Thr His lie Ser Ala Glu Gin Lys Arg Arg Phe Asn lie Lys 
770 775 780 



Leu Gly Phe Asp Thr Leu His Gly Leu Val Ser Thr Leu Ser Ala Gin 
785 790 795 800 



Pro Ser Leu Lys Val Ser Lys Ala Thr Thr Leu Gin Lys Thr Ala Glu 
805 810 815 
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Tyr lie Leu Met Leu Gin Gin Glu Arg Ala Gly Leu Gin Glu Glu Ala 
820 825 830 



Gin Gin Leu Arg Asp Glu lie Glu Glu Leu Asn Ala Ala He Asn Leu 
835 840 845 



Cys Gin Gin Gin Leu Pro Ala Thr Gly Val Pro He Thr His Gin Arg 
850 855 860 



Phe Asp Gin Met Arg Asp Met Phe Asp Asp Tyr Val Arg Thr Arg Thr 
865 870 875 880 



Leu His Asn Trp Lys Phe Trp Val Phe Ser He Leu He Arg Pro Leu 
885 890 895 



Phe Glu Ser Phe Asn Gly Met Val Ser Thr Ala Ser Val His Thr Leu 
900 905 910 



Arg Gin Thr Ser Leu Ala Trp Leu Asp Gin Tyr Cys Ser Leu Pro Ala 
915 920 925 



Leu Arg Pro Thr Val Leu Asn Ser Leu Arg Gin Leu Gly Thr Ser Thr 
930 935 940 



Ser He Leu Thr Asp Pro Gly Arg He Pro Glu Gin Ala Thr Arg Ala 
945 950 955 960 



Val Thr Glu Gly Thr Leu Gly Lys Pro Leu 
965 970 



<210> 132 

<211> 719 

<212> PRT 

<213> Homo sapien 

<400> 132 

Ala Val Pro Arg Ala Ala Arg Ser Arg Asp Gin Ala Val Ala Ala Ala 
! 5 10 15 



Thr Ala Met Ala Gly Ala Leu Ala Gly Leu Ala Ala Gly Leu Gin Val 
20 25 30 



Pro Arg Val Ala Pro Ser Pro Asp Ser Asp Ser Asp Thr Asp Ser Glu 
35 40 45 



Asp Pro Ser Leu Arg Arg Ser Ala Gly Gly Leu Leu Arg Ser Gin Val 
50 55 60 
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He His Ser Gly His Phe Met Val Ser Ser Pro His Ser Asp Ser Leu 
65 70 75 80 



Pro Arg Arg Arg Asp Gin Glu Gly Ser Val Gly Pro Ser Asp Phe Gly 
85 90 95 



Pro Arg Ser He Asp Pro Thr Leu Thr Arg Leu Phe Glu Cys Leu Ser 
100 105 no 



Leu Ala Tyr Ser Gly Lys Leu Val Ser Pro Lys Trp Lys Asn Phe Lys 
115 120 125 



Gly Leu Lys Leu Leu Cys Arg Asp Lys He Arg Leu Asn Asn Ala He 
130 135 140 



Trp Arg Ala Trp Tyr He Gin Tyr Val Lys Arg Arg Lys Ser Pro Val 
145 150 155 160 



Cys Gly Phe Val Thr Pro Leu Gin Gly Pro Glu Ala Asp Ala His Arg 
165 170 175 



Lys Pro Glu Ala Val Val Leu Glu Gly Asn Tyr Trp Lys Arg Arg He 
180 185 190 



Glu Val Val Met Arg Glu Tyr His Lys Trp Arg He Tyr Tyr Lys Lys 
195 200 205 



Arg Val Ser Gly Gly Gly Pro Gly Arg Pro Gin Ser Phe Pro Pro Ala 
210 215 220 



Ala Ala Gly Tyr Arg Pro Pro Arg Lys He Pro Gly Lys Gly He Leu 
225 230 235 240 



Thr Pro Glu Leu Ala Pro Leu Gly Pro Ser He Gin Ser Arg Ala Asp 
245 250 255 



Ser Ala Thr Val Trp Pro Gin Arg Leu Leu Ala Ala Ser Leu Pro Arg 
260 265 270 



Gly Arg Leu Arg Lys Pro Ser Arg Glu Asp Asp Leu Leu Ala Pro Lys 
275 280 285 



Gin Ala Glu Gly Arg Trp Pro Pro Pro Glu Gin Trp Cys Lys Gin Leu 
290 295 300 
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Phe Ser Ser Val Val Pro Val Leu Leu Gly Asp Pro Glu Glu Glu Pro 
305 310 315 320 



Gly Gly Arg Gin Leu Leu Asp Leu Asn Cys Phe Leu Ser Asp He Ser 
325 330 335 



Asp Thr Leu Phe Thr Met Thr Gin Ser Gly Pro Ser Pro Leu Gin Leu 
340 345 350 



Pro Pro Glu Asp Ala Tyr Val Gly Asn Ala Asp Met He Gin Pro Asp 
355 360 365 



Leu Thr Pro Leu Gin Pro Ser Leu Asp Asp Phe Met Asp He Ser Asp 
370 375 380 



Phe Phe Thr Asn Ser Arg Leu Pro Gin Pro Pro Met Pro Ser Asn Phe 
385 390 395 400 



Pro Glu Pro Pro Ser Phe Ser Pro Val Val Asp Ser Leu Phe Ser Ser 
405 410 415 



Gly Thr Leu Gly Pro Glu Val Pro Pro Ala Ser Ser Ala Met Thr His 
420 425 430 



Leu Ser Gly His Ser Arg Leu Gin Ala Arg Asn Ser Cys Pro Gly Pro 
435 440 445 



Leu Asp Ser Ser Ala Phe Leu Ser Ser Asp Phe Leu Leu Pro Glu Asp 
450 455 460 



Pro Lys Pro Arg Leu Pro Pro Pro Pro Val Pro Pro Pro Leu Leu His 
465 ^ 470 475 480 



Tyr Pro Pro Pro Ala Lys Val Pro Gly Leu Glu Pro Cys Pro Pro Pro 
485 490 495 



Pro Phe Pro Pro Met Ala Pro Pro Thr Ala Leu Leu Gin Glu Glu Pro 
500 505 510 



Leu Phe Ser Pro Arg Phe Pro Phe Pro Thr Val Pro Pro Ala Pro Gly 
515 520 525 



Val Ser Pro Leu Pro Ala Pro Ala Ala Phe Pro Pro Thr Pro Gin Ser 
530 535 540 
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Val Pro Ser Pro Ala Pro Thr Pro Phe Pro lie Glu Leu Leu Pro Leu 
545 550 555 560 



Gly Tyr Ser Glu Pro Ala Phe Gly Pro Cys Phe Ser Met Pro Arg Gly 
565 570 575 



Lys Pro Pro Ala Pro Ser Pro Arg Gly Gin Lys Ala Ser Pro Pro Thr 
580 585 590 



Leu Ala Pro Ala Thr Ala Ser Pro Pro Thr Thr Ala Gly Ser Asn Asn 
595 600 605 



Pro Cys Leu Thr Gin Leu Leu Thr Ala Ala Lys Pro Glu Gin Ala Leu 
610 615 620 



Glu Pro Pro Leu Val Ser Ser Thr Leu Leu Arg Ser Pro Gly Ser Pro 
625 630 635 ~ 640 



Gin Glu Thr Val Pro Glu Phe Pro Cys Thr Phe Leu Pro Pro Thr Pro 
645 650 655 



Ala Pro Thr Pro Pro Arg Pro Pro Pro Gly Pro Ala Thr Leu Ala Pro 
660 665 670 



Ser Arg Pro Leu Leu Val Pro Lys Ala Glu Arg Leu Ser Pro Pro Ala 
675 680 " 685 



Pro Ser Gly Lys Glu Arg Gly Cys Arg Asp Trp Gly Asn Leu Gly Asp 
690 695 700 



710 715 



705 




<210> 


133 


<2X1> 


629 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


133 



Met Ala Val Pro Trp Glu Gin Ala Ser Leu Gly Gly Gin Arg Pro Cys 
15 io 15 



Leu Gly Thr Gin Arg Pro Ala Thr Asp Phe Phe Thr Asn Ser Arg Leu 
20 25 30 



Pro Gin Pro Pro Met Pro Ser Asn Phe Pro Glu Pro Pro Ser Phe Ser 
35 40 45 
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Pro Val Val Asp Ser Leu Phe Ser Ser Gly Thr Leu Gly Pro Glu Val 
50 55 60 



Pro Pro Ala Ser Ser Ala Met Thr His Leu Ser Gly His Ser Arg Leu 
65 70 75 80 



Gin Ala Arg Asn Ser Cys Pro Gly Pro Leu Asp Ser Ser Ala Phe Leu 
85 90 95 



Ser Ser Asp Phe Leu Leu Pro Glu Asp Pro Lys Pro Arg Leu Pro Pro 
100 105 A no 



Pro Pro Val Pro Pro Pro Leu Leu His Tyr Pro Pro Pro Ala Lys Val 
115 120 125 



Pro Gly Leu Glu Pro Cys Pro Pro Pro Pro Phe Pro Pro Met Ala Pro 
130 135 140 



Pro Thr Ala Leu Leu Gin Glu Glu Pro Leu Phe Ser Pro Arg Phe Pro 
i45 150 155 ~ 160 



Phe Pro Thr Val Pro Pro Ala Pro Gly Val Ser Pro Leu Pro Ala Pro 
165 170 175 



Ala Ala Phe Pro Pro Thr Pro Gin Ser Val Pro Ser Pro Ala Pro Thr 
180 185 190 



Pro Phe Pro lie Glu Leu Leu Pro Leu Gly Tyr Ser Glu Pro Ala Phe 
195 200 * 205 



Gly Pro Cys Phe Ser Met Pro Arg Gly Lys Pro Pro Ala Pro Ser Pro 
210 215 220 



Arg Gly Gin Lys Ala Ser Pro Pro Thr Leu Ala Pro Ala Thr Ala Ser 
225 230 235 240 



Pro Pro Thr Thr Ala Gly Ser Asn Asn Pro Cys Leu Thr Gin Leu Leu 
245 250 255 



Thr Ala Ala Lys Pro Glu Gin Ala Leu Glu Pro Pro Leu Val Ser Ser 
2 60 265 270 



Thr Leu Leu Arg Ser Pro Gly Ser Pro Gin Glu Thr Val Pro Glu Phe 
27 5 280 285 
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Pro Cys Thr Phe Leu Pro Pro Thr Pro Ala Pro Thr Pro Pro Arg Pro 
290 295 300 



Pro Pro Gly Pro Ala Thr Leu Ala Pro Ser Arg Pro Leu Leu Val Pro 
305 310 315 320 



Lys Ala Glu Arg Leu Ser Pro Pro Ala Pro Ser Gly Lys Glu Arg Gly 
325 330 ~ 335 



Cys Arg Asp Trp Gly Asn Leu Gly Asp Glu Gly Lys Arg Arg Lys Lys 
340 345 350 



Lys Gly Thr Gly Ser Gly Ala Arg Gly Glu Glu Gly Gly Asn Gly Asp 
355 360 365 



Pro Phe Pro Trp Val Gly Leu Ser Val Gly Leu Ser Val Pro Thr Thr 
370 375 * 380 



Gly Ser Glu Arg Arg Leu Ser Gly Asp Leu Ser Ser Met Pro Gly Pro 
385 390 395 400 



Gly Thr Leu Ser Val Arg Val Ser Pro Pro Gin Pro lie Leu Ser Arg 
405 410 415 



Gly Arg Pro Asp Ser Asn Lys Thr Glu Asn Arg Arg He Thr His He 
420 425 430 



Ser Ala Glu Gin Lys Arg Arg Phe Asn He Lys Leu Gly Phe Asp Thr 
435 440 445 



Leu His Gly Leu Val Ser Thr Leu Ser Ala Gin Pro Ser Leu Lys Val 
450 455 460 



Ser Lys Ala Thr Thr Leu Gin Lys Thr Ala Glu Tyr He Leu Met Leu 
465 470 475 480 



Gin Gin Glu Arg Ala Gly Leu Gin Glu Glu Ala Gin Gin Leu Arg Asp 
485 490 495 



Glu He Glu Glu Leu Asn Ala Ala He Asn Leu Cys Gin Gin Gin Leu 
500 505 510 



Pro Ala Thr Gly Val Pro lie Thr His Gin Arg Phe Asp Gin Met Arg 
515 520 525 
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Asp Met Phe Asp Asp Tyr Val Arg Thr Arg Thr Leu His Asn Trp Lys 
530 535 540 



Phe Trp Val Phe Ser He Leu He Arg Pro Leu Phe Glu Ser Phe Asn 
545 550 555 560 



Gly Met Val Ser Thr Ala Ser Val His Thr Leu Arg Gin Thr Ser Leu 
565 570 575 



Ala Trp Leu Asp Gin Tyr Cys Ser Leu Pro Ala Leu Arg Pro Thr Val 
580 585 590 



Leu Asn Ser Leu Arg Gin Leu Gly Thr Ser Thr Ser He Leu Thr Asp 
595 600 . 605 



Pro Gly Arg He Pro Glu Gin Ala Thr Arg Ala Val Thr Glu Gly Thr 
610 615 620 



Leu Gly Lys Pro Leu 



625 




<210> 


134 


<211> 


382 


<212> 


PRT 


<213> 


Homo sap i en 


<400> 


134 



Gly Thr Thr Pro Thr Pro Val Pro Val Pro Gly Gly Gly Gly Gly Cys 
1.5 10 15 



Pro Gly Ala Glu Val Cys Met Ala Val Pro Trp Glu Gin Ala Ser Leu 
20 25 30 



Gly Gly Gin Arg Pro Cys Leu Gly Thr Gin Arg Pro Ala Thr Asp Phe 
35 40 45 



Phe Thr Asn Ser Arg Leu Pro Gin Pro Pro Met Pro Ser Asn Phe Pro 
50 55 60 



Glu Pro Pro Ser Phe Ser Pro Val Val Asp Ser Leu Phe Ser Ser Gly 

65 70 75 80 

Thr Leu Gly Pro Glu Val Pro Pro Ala Ser Ser Ala Met Thr His Leu 
85 90 95 



Ser Gly His Ser Arg Leu Gin Ala Arg Asn Ser Cys Pro Gly Pro Leu 
100 105 H° 
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Asp Ser Ser Ala Phe Leu Ser Ser Asp Phe Leu Leu Pro Glu Asp Pro 
115 120 125 



Lys Pro Arg Leu Pro Pro Pro Pro Val Pro Pro Pro Leu Leu His Tyr 
130 135 140 



Pro Pro Pro Ala Lys Val Pro Gly Leu Glu Pro Cys Pro . Pro Pro Pro 
145 150 155 160 



Phe Pro Pro Met Ala Pro Pro Thr Ala Leu Leu Gin Glu Glu Pro Leu 
165 170 175 



Phe Ser Pro Arg Phe Pro Phe Pro Thr Val Pro Pro Ala Pro Gly Val 
180 185 190 



Ser Pro Leu Pro Ala Pro Ala Ala Phe Pro Pro Thr Pro Gin Ser Val 
195 200 205 



Pro Ser Pro Ala Pro Thr Pro Phe Pro lie Glu Leu Leu Pro Leu Gly 
210 215 220 



Tyr Ser Glu Pro Ala Phe Gly Pro Cys Phe Ser Met Pro Arg Gly Lys 
225 230 235 240 



Pro Pro Ala Pro Ser Pro Arg Gly Gin Lys Ala Ser Pro Pro Thr Leu 
245 250 255 



Ala Pro Ala Thr Ala Ser Pro Pro Thr Thr Ala Gly Ser Asn Asn Pro 
260 265 270 



Cys Leu Thr Gin Leu Leu Thr Ala Ala Lys Pro Glu Gin Ala Leu Glu 
275 280 285 



Pro Pro Leu Val Ser Ser Thr Leu Leu Arg Ser Pro Gly Ser Pro Gin 
290 295 300 



Glu Thr Val Pro Glu Phe Pro Cys Thr Phe Leu Pro Pro Thr Pro Ala 
305 310 315 320 



Pro Thr Pro Pro Arg Pro Pro Pro Gly Pro Ala Thr Leu Ala Pro Ser 
325 330 335 



Arg Pro Leu Leu Val Pro Lys Ala Glu Arg Leu Ser Pro Pro Ala Pro 
340 345 350 
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Ser Gly Lys Glu Arg Gly Cys Arg Asp Trp Gly Asn Leu Gly Asp Glu 
355 360 365 



Gly Lys Arg Arg Lys Lys Lys Gly Thr Gly Ser Gly Ala Arg 
370 375 380 



<210> 135 

<211> 1009 

<212> PRT 

<213> Homo sapien 

<400> 135 

Met Ala Gly Ala Leu Ala Gly Leu Ala Ala Gly Leu Gin Val Pro Arg 
15 10 15 



Val Ala Pro Ser Pro Asp Ser Asp Ser Asp Thr Asp Ser Glu Asp Pro 
20 25 30 



Ser Leu Arg Arg Ser Ala Gly Gly Leu Leu Arg Ser Gin Val lie His 
35 40 45 



Ser Gly Hi's Phe Met Val Ser Ser Pro His Ser Asp Ser Leu Pro Arg 
50 55 60 



Arg Arg Asp Gin Glu Gly Ser Val Gly Pro Ser Asp Phe Gly Pro Arg 
65 70 75 80 



Ser lie Asp Pro Thr Leu Thr Arg Leu Phe Glu Cys Leu Ser Leu Ala 
85 90 95 



Tyr Ser Gly Lys Leu Val Ser Pro Lys Trp Lys Asn Phe Lys Gly Leu 
100 105 110 



Lys Leu Leu Cys Arg Asp Lys lie Arg Leu Asn Asn Ala He Trp Arg 
115 120 125 



Ala Trp Tyr He Gin Tyr Val Lys Arg Arg Lys Ser Pro Val Cys Gly 
130 135 140 



Phe Val Thr Pro Leu Gin Gly Pro Glu Ala Asp Ala His Arg Lys Pro 
145 150 155 160 



Glu Ala Val Val Leu Glu Gly Asn Tyr Trp Lys Arg Arg He Glu Val 
165 170 175 



Val Met Arg Glu Tyr His Lys Trp Arg He Tyr Tyr Lys Lys Arg Val 
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180 



185 



190 



Ser Gly Gly Gly Pro Gly Arg Pro Gin Ser Phe Pro Pro Ala Ala Ala 
195 200 205 



Gly Tyr Arg Pro Pro Arg Lys lie Pro Gly Lys Gly lie Leu Thr Pro 
210 215 220 



Glu Leu Ala Pro Leu Gly Pro Ser lie Gin Ser Arg Ala Asp Ser Ala 
225 230 235 240 



Thr Val Trp Pro Gin Arg Leu Leu Ala Ala Ser Leu Pro Arg Gly Arg 
245 250 255 



Val Ser Ser Cys Gly Lys Trp Gly lie Ser Ser Leu Val Ser Arg Gly 
260 265 270 



Asp Phe Gly Gly Lys Val Met Glu Asp Gly Ala Lys Gly Ser Gly Val 
275 280 285 



Gin Leu Cys Val Ser Leu Gin Leu Arg Lys Pro Ser Arg Glu Asp Asp 
290 295 300 



Leu Leu Ala Pro Lys Gin Ala Glu Gly Arg Trp Pro Pro Pro Glu Gin 
305 310 315 320 



Trp Cys Lys Gin Leu Phe Ser Ser Val Val Pro Val Leu Leu Gly Asp 
325 330 335 



Pro Glu Glu Glu Pro Gly Gly Arg Gin Leu Leu Asp Leu Asn Cys Phe 
340 345 350 



Leu Ser Asp lie Ser Asp Thr Leu Phe Thr Met Thr Gin Ser Gly Pro 
355 360 365 



Ser Pro Leu Gin Leu Pro Pro Glu Asp Ala Tyr Val Gly Asn Ala Asp 
370 375 380 



Met lie Gin Pro Asp Leu Thr Pro Leu Gin Pro Ser Leu Asp Asp Phe 
385 390 395 400 



Met Asp lie Ser Asp Phe Phe Thr Asn Ser Arg Leu Pro Gin Pro Pro 
405 410 415 



Met Pro Ser Asn Phe Pro Glu Pro Pro Ser Phe Ser Pro Val Val Asp 
420 425 430 
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Ser Leu Phe Ser Ser Gly Thr Leu Gly Pro Glu Val Pro Pro Ala Ser 
435 440 445 



Ser Ala Met Thr His Leu Ser Gly His Ser Arg Leu Gin Ala Arg Asn 
450 455 460 

Ser Cys Pro Gly Pro Leu Asp Ser Ser Ala Phe Leu Ser Ser Asp Phe 
465 470 475 480 

Leu Leu Pro Glu Asp Pro Lys Pro Arg Leu Pro Pro Pro Pro Val Pro 
485 490 495 



Pro Pro Leu Leu His Tyr Pro Pro Pro Ala Lys Val Pro Gly Leu Glu 
500 505 510 

Pro Cys Pro Pro Pro Pro Phe Pro Pro Met Ala Pro Pro Thr Ala Leu 
515 520 525 

Leu Gin Glu Glu Pro Leu Phe Ser Pro Arg Phe Pro Phe Pro Thr Val 
530 535 540 



Pro Pro Ala Pro Gly Val Ser Pro Leu Pro Ala Pro Ala Ala Phe Pro 
545 550 555 560 

Pro Thr Pro Gin Ser Val Pro Ser Pro Ala Pro Thr Pro Phe Pro lie 
565 570 575 



Glu Leu Leu Pro Leu Gly Tyr Ser Glu Pro Ala Phe Gly Pro Cys Phe 
580 585 590 



Ser Met Pro Arg Gly Lys Pro Pro Ala Pro Ser Pro Arg Gly Gin Lys 
595 600 605 

Ala Ser Pro Pro Thr Leu Ala Pro Ala Thr Ala Ser Pro Pro Thr Thr 
610 615 620 

Ala Gly Ser Asn Asn Pro Cys Leu Thr Gin Leu Leu Thr Ala Ala Lys 
625 630 635 640 



Pro Glu Gin Ala Leu Glu Pro Pro Leu Val Ser Ser Thr Leu Leu Arg 
645 650 655 



Ser Pro Gly Ser Pro Gin Glu Thr Val Pro Glu Phe Pro Cys Thr Phe 
660 665 670 
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Leu Pro Pro Thr Pro Ala Pro Thr Pro Pro Arg Pro Pro Pro Gly Pro 
675 680 685 



Ala Thr Leu Ala Pro Ser Arg Pro Leu Leu Val Pro Lys Ala Glu Arg 
690 695 700 



Leu Ser Pro Pro Ala Pro Ser Gly Lys Glu Arg Gly Cys Arg Asp Trp 
705 710 715 720 



Gly Asn Leu Gly Asp Glu Gly Lys Arg Arg Lys Lys Lys Gly Thr Gly 
725 730 735 



Ser Gly Ala Arg Trp Glu Glu Gly Gly Asn Gly Asp Pro Phe Pro Trp 
740 745 750 



Val Gly Leu Ser Val Gly Leu Ser Val Pro Thr Thr Gly Ser Glu Arg 
755 760 765 



Arg Leu Ser Gly Asp Leu Ser Ser Met Pro Gly Pro Gly Thr Leu Ser 
770 775 780 



Val Arg Val Ser Pro Pro Gin Pro lie Leu Ser Arg Gly Arg Pro Asp 
785 790 795 800 



Ser Asn Lys Thr Glu Asn Arg Arg lie Thr His lie Ser Ala Glu Gin 
805 810 815 



Lys Arg Arg Phe Asn lie Lys Leu Gly Phe Asp Thr Leu His Gly Leu 
820 825 830 



Val Ser Thr Leu Ser Ala Gin Pro Ser Leu Lys Val Ser Lys Ala Thr 
835 840 845 



Thr Leu Gin Lys Thr Ala Glu Tyr lie Leu Met Leu Gin Gin Glu Arg 
850 855 860 



Ala Gly Leu Gin Glu Glu Ala Gin Gin Leu Arg Asp Glu He Glu Glu 
865 870 875 880 



Leu Asn Ala Ala He Asn Leu Cys Gin Gin Gin Leu Pro Ala Thr Gly 
885 890 895 



Val Pro He Thr His Gin Arg Phe Asp Gin Met Arg Asp Met Phe Asp 
900 905 910 
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Asp Tyr Val Arg Thr Arg Thr Leu His Asn Trp Lys Phe Trp Val Phe 
915 ~ 920 925 



Ser lie Leu lie Arg Pro Leu Phe Glu Ser Phe Asn Gly Met Val Ser 
930 935 940 



Thr Ala Ser Val His Thr Leu Arg Gin Thr Ser Leu Ala Trp Leu Asp 
945 950 955 960 



Gin Tyr Cys Ser Leu Pro Ala Leu Arg Pro Thr Val Leu Asn Ser Leu 
965 970 975 



Arg Gin Leu Gly Thr Ser Thr Ser lie Leu Thr Asp Pro Gly Arg He 
980 985 990 



Pro Glu Gin Ala Thr Arg Ala Val Thr Glu Gly Thr Leu Gly Lys . Pro 
995 1000 1005 



Leu 



<210> 136 

<211> 469 

<212> PRT 

<213> Homo sapien 

<400> 136 

Arg Arg Leu Trp Gly Glu Ser Asp Gly Gly Trp Gly Lys Gly Ser Gly 
15 10 15 



Val Gin Leu Cys Val Ser Leu Gin Leu Arg Lys Pro Ser Arg Glu Asp 
20 25 30 



Asp Leu Leu Ala Pro Lys Gin Ala Glu Gly Arg Trp Pro Pro Pro Glu 
35 40 45 



Gin Trp Cys Lys Gin Leu Phe Ser Ser Val Val Pro Val Leu Leu Gly 
50 55 60 



Asp Pro Glu Glu Glu Pro Gly Gly Arg Gin Leu Leu Asp Leu Asn Cys 
65 70 75 80 



Phe Leu Ser Asp He Ser Asp Thr Leu Phe Thr Met Thr Gin Ser Gly 
85 90 95 



Pro Ser Pro Leu Gin Leu Pro Pro Glu Asp Ala Tyr Vai Gly Asn Ala 
100 105 HO 
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Asp Met lie Gin Pro Asp Leu Thr Pro Leu Gin Pro Ser Leu Asp Asp 
115 120 125 



Phe Met Asp lie Ser Asp Phe Phe Thr Asn Ser Arg Leu Pro Gin Pro 
130 135 140 



Pro Met Pro Ser Asn Phe Pro Glu Pro Pro Ser Phe Ser Pro Val Val 
145 150 155 160 



Asp Ser Leu Phe Ser Ser Gly Thr Leu Gly Pro Glu Val Pro Pro Ala 
165 170 175 



Ser Ser Ala Met Thr His Leu Ser Gly His Ser Arg Leu Gin Ala Arg 
180 185 190 



Asn Ser Cys Pro Gly Pro Leu Asp Ser Ser Ala Phe Leu Ser Ser Asp 
195 200 205 



Phe Leu Leu Pro Glu Asp Pro Lys Pro Arg Leu Pro Pro Pro Pro Val 
210 215 220 



Pro Pro Pro Leu Leu His Tyr Pro Pro Pro Ala Lys Val Pro Gly Leu 
225 230 235 240 



Glu Pro Cys Pro Pro Pro Pro Phe Pro Pro Met Ala Pro Pro Thr Ala 
245 250 255 



Leu Leu Gin Glu Glu Pro Leu Phe Ser Pro Arg Phe Pro Phe Pro Thr 
260 265 270 



Val Pro Pro Ala Pro Gly Val Ser Pro Leu Pro Ala Pro Ala Ala Phe 
275 280 285 



Pro Pro Thr Pro Gin Ser Val Pro Ser Pro Ala Pro Thr Pro Phe Pro 
290 295 300 



lie Glu Leu Leu Pro Leu Gly Tyr Ser Glu Pro Ala Phe Gly Pro Cys 
305 310 315 320 



Phe Ser Met Pro Arg Gly Lys Pro Pro Ala Pro Ser Pro Arg Gly Gin 
325 330 335 



Lys Ala Ser Pro Pro Thr Leu Ala Pro Ala Thr Ala Ser Pro Pro Thr 
340 345 350 
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Thr Ala Gly Ser Asn Asn Pro Cys Leu Thr Gin Leu Leu Thr Ala Ala 
355 360 365 



Lys Pro Glu Gin Ala Leu Glu Pro Pro Leu Val Ser Ser Thr Leu Leu 
370 375 380 



Arg Ser Pro Gly Ser Pro Gin Glu Thr Val Pro Glu Phe Pro Cys Thr 
385 390 395 400 



Phe Leu Pro Pro Thr Pro Ala Pro Thr Pro Pro Arg Pro Pro Pro Gly 
405 410 415 



Pro Ala Thr Leu Ala Pro Ser Arg Pro Leu Leu Val Pro Lys Ala Glu 
420 425 430 



Arg Leu Ser Pro Pro Ala Pro Ser Gly Lys Glu Arg Gly Cys Arg Asp 
435 440 445 



Trp Gly Asn Leu Gly Asp Glu Gly Lys Arg Arg Lys Lys Lys Gly Thr 
450 455 460 



Gly Ser Gly Ala Arg 



465 




<210> 


137 


<211> 


191 


<212> 


PRT 


<213> 


Homo 


<400> 


137 


Val Leu Pro 


1 





10 15 



Pro Thr Val Ser Pro Tyr Pro His Pro Gly Pro Gly Trp Ser Pro Ala 
20 25 30 



His Arg Gly Ala Thr Gly Gly Asn Gly Pro Asp Ser Gly Ala Thr Asp 
35 40 45 



Ser Pro Ala Val Gly Asp Asp Asp Arg Glu Gin Val Ala Gly Pro Lys 
50 55 60 



Gly Lys Ala Pro Pro Val Pro Ala Pro Ala Arg Glu Ser Gly Asn Arg 
65 70 75 80 



Ser Ala Arg Pro Leu His Ser Leu Ser Val Leu Ala Phe Asp Gin Glu 
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85 90 95 



Arg Leu Glu Arg Lys lie Leu Ala Leu Arg Gin Ala Arg Arg Pro Val 
100 105 110 



Pro Pro Glu Val Ala Gin Gin Tyr Gin Asp He Met Gin Arg Ser Gin 
115 120 125 



Trp Gin Arg Ala Gin Leu Glu Gin Gly Gly Val Gly He Arg Arg Glu 
130 135 14 0 



Tyr Ala Ala Gin Leu Glu Arg Gin Leu Gin Phe Tyr Thr Glu Ala Ala 
145 150 155 160 



Arg Arg Leu Gly Asn Asp Gly Ser Arg Asp Ala Ala Lys Glu Ala Leu 
165 170 175 



Tyr Arg Arg Asn Leu Val Glu Ser Glu Leu Gin Arg Leu Arg Arg 
180 185 190.. 



<210> 138 

<211> 117 

<212> PRT 

<213> Homo sapien 

<400> 138 

Glu Gly Leu Gly Asp Gly His Leu Glu Gly Glu Gly Leu Gly Asp Gly 
15 10 15 



His Leu Glu Gly Glu Gly Leu Gly Asp Gly His Leu Glu Gly Glu Gly 
20 25 30 



Leu Gly Asp Gly His Leu Ala Gly Asp Gly Leu Gly Asp Gly His Ser 
35 40 45 



Pro Gly Gly Leu Trp Ala Val Arg Pro His Pro Lys Pro Pro Phe Pro 
50 55 60 



Arg His Pro Phe Ser Glu Tyr Ala Ala Gin Leu Glu Arg Gin Leu Gin 
65 70 75 80 



Phe Tyr Thr Glu Ala Ala Arg Arg Leu Gly Asn Asp Gly Ser Arg Asp 
85 90 95 



Ala Ala Lys Glu Ala Leu Tyr Arg Arg Asn Leu Val Glu Ser Glu Leu 
100 105 110 
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Gin Arg Leu Arg Arg 





TIC 

lib 


<210> 


139 


<211> 


165 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


139 



Arg Val Ser Cys Ser Gly Ser Ala Gly Glu Glu Pro Met Gly Arg Ala 
1 5 10 15 



Ala Pro Arg Lys Arg Ala Ala Gly Pro Asp Thr Gly Lys Ser Arg His 
20 25 30 



Ser His Glu Pro Asp Lys Gin Thr He Ser Gly Gin Ser Val Leu Asp 
35 40 45 



Ser Pro Leu He Arg Ala Pro Ser Pro Ala Arg Ala Ser Val Ala Ala 
50 55 60 



Gly Val Gly Asn His Ala Cys Gin Pro Val Cys Ala Pro His Pro Gly 
65 70 75 80 



Leu Ala Gly Pro Trp Arg Val Leu Phe Ala Gin Pro Arg Gly Val Arg 
85 90 95 



Pro Leu Ala Arg Pro Gly Ala Gly Arg Val Ala Gly Ala Lys Pro Arg 
100 105 110 



Gly Pro Leu Gin Ala Leu Tyr Phe Leu Phe Leu Pro Ser Leu Asn Pro 
115 120 125 



Lys Ala Leu Leu His Pro Lys Glu Ala Thr Glu Ala Gly Arg Ala Thr 
130 135 140 



Leu Ser Pro Gin Gly Arg Arg Pro Gly Pro Ala Gly Ser Pro Gly Pro 
145 150 155 160 



Ala His Gly He Lys 
165 



<210> 140 

<211> 146 

<212> PRT 

<213> Homo sapien 
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<400> 140 

Glu Gly Leu Gly Asp Gly His Leu Glu Gly Glu Gly Leu Gly Asp Gly 
15 10 15 



His Leu Glu Gly Glu Gly Leu Gly Asp Gly His Leu Glu Gly Glu Gly 
20 25 30 



Leu Gly Asp Gly His Leu Ala Gly Asp Gly Leu Gly Asp Gly His Ser 
35 40 45 



Pro Gly Gly Leu Trp Ala Val Arg Pro His Pro Lys Pro Pro Phe Pro 
50 55 60 



Arg His Pro Phe Ser Glu Tyr Ala Ala Gin Leu Glu Arg Gin Leu Gin 
65 70 75 80 



Phe Tyr Thr Glu Ala Ala Arg Arg Leu Gly Asn Asp Gly Ser Arg Val 
85 90 95 



Ser Trp Ser Arg Ala Gly Trp Ala Leu Gly Ser Gly Gin Gly Gly Ala 
100 105 110 



Cys Arg Asp Tyr Leu Leu Asn Ala His Pro Pro Gin Asp Ala Ala Lys 
115 120 125 



Glu Ala Leu Tyr Arg Arg Asn Leu Val Glu Ser Glu Leu Gin Arg Leu 
130 135 140 



Arg Arg 



145 




<210> 


141 


<211> 


165 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


141 



Arg Val Ser Cys Ser Gly Ser Ala Gly Glu Glu Pro Met Gly Arg Ala 
15 10 15 



Ala Pro Arg Lys Arg Ala Ala Gly Pro Asp Thr Gly Lys Ser Arg His 
20 25 30 



Ser His Glu Pro Asp Lys Gin Thr lie Ser Gly Gin Ser Val Leu Asp 
35 40 45 
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Ser Pro Leu lie Arg Ala Pro Ser Pro Ala Arg Ala Ser Val Ala Ala 
50 55 60 



Gly Val Gly Asn His Ala Cys Gin Pro Val Cys Ala Pro His Pro Gly 
65 70 75 80 



Leu Ala Gly Pro Trp Arg Val Leu Phe Ala Gin Pro Arg Gly Val Arg 
85 90 95 



Pro Leu Ala Arg Pro Gly Ala Gly Arg Val Ala Gly Ala Lys Pro Arg 
100 105 110 



Gly Pro Leu Gin Ala Leu Tyr Phe Leu Phe Leu Pro Ser Leu Asn Pro 
115 120 125 



Lys Ala Leu Leu His Pro Lys Glu Ala Thr Glu Ala Gly Arg Ala Thr 
130 135 140 



Leu Ser Pro Gin Gly Arg Arg Pro Gly Pro Ala Gly Ser Pro Gly Pro 
145 150 155 160 



Ala His Gly lie Lys 
165 



<210> 142 

<211> 636 

<212> PRT 

<213> Homo sapien 

<400> 142 

Met Thr Ser Ser Gly Pro Gly Pro Arg Phe Leu Leu Leu Leu Pro Leu 
15 10 15 



Leu Leu Pro Pro Ala Ala Ser Ala Ser Asp Arg Pro Arg Gly Arg Asp 
20 25 30 



Pro Val Asn Pro Glu Lys Leu Leu Val He Thr Val Ala Thr Ala Glu 
35 40 45 



Thr Glu Gly Tyr Leu Arg Phe Leu Arg Ser Ala Glu Phe Phe Asn Tyr 
50 55 60 



Thr Val Arg Thr Leu Gly Leu Gly Glu Glu Trp Arg Gly Gly Asp Val 
65 70 75 80 



Ala Arg Thr Val Gly Gly Gly Gin Lys Val Arg Trp Leu Lys Lys Glu 
85 90 95 
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Met Glu Lys Tyr Ala Asp Arg Glu Asp Met lie He Met Phe Val Asp 
100 105 HO 



Ser Tyr Asp Val He Leu Ala Gly Ser Pro Thr Glu Leu Leu Lys Lys 
115 120 125 



Phe Val Gin Ser Gly Ser Arg Leu Leu Phe Ser Ala Glu Ser Phe Cys 
130 135 140 



Trp Pro Glu Trp Gly Leu Ala Glu Gin Tyr Pro Glu Val Gly Thr Gly 
145 150 155 160 



Lys Arg Phe Leu Asn Ser Gly Gly Phe He Gly Phe Ala Thr Thr He 
165 170 175 



His Gin He Val Arg Gin Trp Lys Tyr Lys Asp Asp Asp Asp Asp Gin 
180 185 190 



Leu Phe Tyr Thr Arg Leu Tyr Leu Asp Pro Gly Leu Arg Glu Lys Leu 
195 200 205 



Ser Leu Asn Leu Asp His Lys Ser Arg He Phe Gin Asn Leu Asn Gly 
210 215 220 



Ala Leu Asp Glu Val Val Leu Lys Phe Asp Arg Asn Arg Val Arg He 
225 230 235 240 



Arg Asn Val Ala Tyr Asp Thr Leu Pro He Val Val His Gly Asn Gly 
245 250 255 



Pro Thr Lys Leu Gin Leu Asn Tyr Leu Gly Asn Tyr Val Pro Asn Gly 
260 265 270 



Trp Thr Pro Glu Gly Gly Cys Gly Phe Cys Asn Gin Asp Arg Arg Thr 
275 280 285 



Leu Pro Gly Gly Gin Pro Pro Pro Arg Val Phe Leu Ala Val Phe Val 
290 295 300 



Glu Gin Pro Thr Pro Phe Leu Pro Arg Phe Leu Gin Arg Leu Leu Leu 
305 310 315 320 



Leu Asp Tyr Pro Pro Asp Arg Val Thr Leu Phe Leu His Asn Asn Glu 
325 330 335 
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Val Phe His Glu Pro His lie Ala Asp Ser Trp Pro Gin Leu Gin Asp 
340 345 350 



His Phe Ser Ala Val Lys Leu Val Gly Pro Glu Glu Ala Leu Ser Pro 
355 360 365 



Gly Glu Ala Arg Asp Met Ala Met Asp Leu Cys Arg Gin Asp Pro Glu 
370 375 380 



Cys Glu Phe Tyr Phe Ser Leu Asp Ala Asp Ala Val Leu Thr Asn Leu 
385 390 395 400 



Gin Thr Leu Arg He Leu He Glu Glu Asn Arg Lys Val He Ala Pro 
405 410 415 



Met Leu Ser Arg His Gly Lys Leu Trp Ser Asn Phe Trp Gly Ala Leu 
420 425 430 



Ser Pro Asp Glu Tyr Tyr Ala Arg Ser Glu Asp Tyr Val Glu Leu Val 
435 440 445 



Gin Arg Lys Arg Val Gly Val Trp Asn Val Pro Tyr He Ser Gin Ala 
450 455 460 



Tyr Val He Arg Gly Asp Thr Leu Arg Met Glu Leu Pro Gin Arg Asp 
465 470 475 480 



Val Phe Ser Gly Ser Asp Thr Asp Pro Asp Met Ala Phe Cys Lys Ser 
485 490 495 



Phe Arg Asp Lys Gly He Phe Leu His Leu Ser Asn Gin His Glu Phe 
500 505 510 



Gly Arg Leu Leu Ala Thr Ser Arg Tyr Asp Thr Glu His Leu His Pro 
515 520 525 



Asp Leu Trp Gin He Phe Asp Asn Pro Val Asp Trp Lys Glu Gin Tyr 
530 535 540 



He His Glu Asn Tyr Ser Arg Ala Leu Glu Gly Glu Gly He Val Glu 
545 550 555 560 



Gin Pro Cys Pro Asp Val Tyr Trp Phe Pro Leu Leu Ser Glu Gin Met 
565 570 575 
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Cys Asp Glu Leu Val Ala Glu Met Glu His Tyr Gly Gin Trp Ser Gly 
580 585 ' 590 



Gly Arg His Glu Thr Ala Leu Gin Glu Cys Leu Met Pro Val Glu Leu 
595 600 605 



Cys Pro Ala Val Ser Ser Ser His His Thr Ala Ser Lys Ala Ala Met 
610 615 620 



Ser Ser Leu Gly Ala He Glu Ala Thr Gly Lys Pro 

630 635 



625 




<210> 


143 


<211> 


682 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


143 



Met Thr Ser Ser Gly Pro Gly Pro Arg Phe Leu Leu Leu Leu Pro Leu 
1 5 10 15 



Leu Leu Pro Pro Ala Ala Ser Ala Ser Asp Arg Pro Arg Gly Arg Asp 

20 25 .30 



Pro Val Asn Pro Glu Lys Leu Leu Val He Thr Val Ala Thr Ala Glu 
35 40 45 



Thr Glu Gly Tyr Leu Arg Phe Leu Arg Ser Ala Glu Phe Phe Asn Tyr 
50 55 60 



Thr Val Arg Thr Leu Gly Leu Gly Glu Glu Trp Arg Gly Gly Asp Val 
65 70 75 80 



Ala Arg Thr Val Gly Gly Gly Gin Lys Val Arg Trp Leu Lys Lys Glu 
85 90 95 



Met Glu Lys Tyr Ala Asp Arg Glu Asp Met He He Met Phe Val Asp 
100 105 HO 



Ser Tyr Asp Val He Leu Ala Gly Ser Pro Thr Glu Leu Leu Lys Lys 
115 120 125 



Phe Val Gin Ser Gly Ser Arg Leu Leu Phe Ser Ala Glu Ser Phe Cys 
130 135 140 



Trp Pro Glu Trp Gly Leu Ala Glu Gin Tyr Pro Glu Val Gly Thr Gly 
145 150 155 160 
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Lys Arg Phe Leu Asn Ser Gly Gly Phe lie Gly Phe Ala Thr Thr He 
165 170 175 



His Gin He Val Arg Gin Trp Lys Tyr Lys Asp Asp Asp Asp Asp Gin 
180 185 190 



Leu Phe Tyr Thr Arg Leu Tyr Leu Asp Pro Gly Leu Arg Glu Lys Leu 
195 200 205 



Ser Leu Asn Leu Asp His Lys Ser Arg He Phe Gin Asn Leu Asn Gly 
210 215 220 



Ala Leu Asp Glu Val Val Leu Lys Phe Asp Arg Asn Arg Val Arg He 
225 230 235 240 



Arg Asn Val Ala Tyr Asp Thr Leu Pro He Val Val His Gly Asn Gly 
245 250 255 



Pro Thr Lys Leu Gin Leu Asn Tyr Leu Gly Asn Tyr Val Pro Asn Gly 
260 265 270 



Trp Thr Pro Glu Gly Gly Cys Gly Phe Cys Asn Gin Asp Arg Arg Thr 
275 280 285 



Leu Pro Gly Gly Gin Pro Pro Pro Arg Val Phe Leu Ala Val Phe Val 
290 295 300 



Glu Gin Pro Thr Pro Phe Leu Pro Arg Phe Leu Gin Arg Leu Leu Leu 
305 310 315 320 



Leu Asp Tyr Pro Pro Asp Arg Val Thr Leu Phe Leu His Asn Asn Glu 
325 330 335 



Val Phe His Glu Pro His He Ala Asp Ser Trp Pro Gin Leu Gin Asp 
340 345 350 



His- Phe Ser Ala Val Lys Leu Val Gly Pro Glu Glu Ala Leu Ser Pro 
355 360 365 



Gly Glu Ala Arg Asp Met Ala Met Asp Leu Cys Arg Gin Asp Pro Glu 
370 375 380 



Cys Glu Phe Tyr Phe Ser Leu Asp Ala Asp Ala Val Leu Thr Asn Leu 
385 390 395 400 
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Gin Thr Leu Arg lie Leu lie Glu Glu Asn Arg Lys Val lie Ala Pro 
405 410 415 



Met Leu Ser Arg His Gly Lys Leu Trp Ser Asn Phe Trp Gly Ala Leu 
420 425 430 



Ser Pro Asp Glu Tyr Tyr Ala Arg Ser Glu Asp Tyr Val Glu Leu Val 
435 440 445 



Gin Arg Lys Arg Val Gly Val Trp Asn Val Pro Tyr He Ser Gin Ala 
450 455 460 



Tyr Val He Arg Gly Asp Thr Leu Arg Met Glu Leu Pro Gin Arg Asp 
465 470 475 480 



Val Phe Ser Gly Ser Asp Thr Asp Pro Asp Met Ala Phe Cys Lys Ser 
485 490 495 



Phe Arg Asp Lys Gly He Phe Leu His Leu Ser Asn Gin His Glu Phe 
500 505 510 



Gly Arg Leu Leu Ala Thr Ser Arg Tyr Asp Thr Glu His Leu His Pro 
515 520 525 



Asp Leu Trp Gin He Phe Asp Asn Pro Val Asp Trp Lys Glu Gin Tyr 
530 535 540 



He His Glu Asn Tyr Ser Arg Ala Leu Glu Gly Glu Gly He Val Glu 
545 550 555 560 



Gin Pro Cys Pro Asp Val Tyr Trp Phe Pro Leu Leu Ser Glu Gin Met 
565 570 575 



Cys Asp Glu Leu Val Ala Glu Met Glu His Tyr Gly Gin Trp Ser Gly 
580 585 590 



Gly Arg His Glu Val Arg Ala Trp Ala Arg Arg Ala Glu Ala Phe Pro 
595 600 605 



Glu Glu Glu Val Gin Asp Leu Val Asn Lys Gin Trp Asn Gin Glu Asn 
610 615 620 



Gly Ser Lys Glu Gly Phe Pro Gly Gly Arg Gin Thr Leu Gly Ser Gin 
625 630 635 640 
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Leu His Ser Phe Met Tyr Ser Arg Asn lie Tyr Cys Trp Ala Gin Trp 
645 650 655 



Leu Met Pro Val lie Leu Ala Leu Trp Glu Ala Glu Ala Gly Gly Leu 
660 665 670 



Leu Glu Gin Gly Glu Phe Glu Ala Ser Leu 

680 





675 


<210> 


144 


<211> 


654 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


144 



Met Thr Ser Ser Gly Pro Gly Pro Arg Phe Leu Leu Leu Leu Pro Leu 
1 5 10 15 



Leu Leu Pro Pro Ala Ala Ser Ala Ser Asp Arg Pro Arg Gly Arg Asp 
20 25 30 



Pro Val Asn Pro Glu Lys Leu Leu Val He Thr Val Ala Thr Ala Glu 
35 40 45 



Thr Glu Gly Tyr Leu Arg Phe Leu Arg Ser Ala Glu Phe Phe Asn Tyr 
50 55 60 



Thr Val Arg Thr Leu Gly Leu Gly Glu Glu Trp Arg Gly Gly Asp Val 
65 70 75 80 



Ala Arg Thr Val Gly Gly Gly Gin Lys Val Arg Trp Leu Lys Lys Glu 
85 90 95 



Met Glu Lys Tyr Ala Asp Arg Glu Asp Met He He Met Phe Val Asp 
100 105 HO 



Ser Tyr Asp Val He Leu Ala Gly Ser Pro Thr Glu Leu Leu Lys Lys 
115 120 125 



Phe Val Gin Ser Gly Ser Arg Leu Leu Phe Ser Ala Glu Ser Phe Cys 
130 135 140 



Trp Pro Glu Trp Gly Leu Ala Glu Gin Tyr Pro Glu Val Gly Thr Gly 
145 150 155 160 



Lys Arg Phe Leu Asn Ser Gly Gly Phe He Gly Phe Ala Thr Thr He 
165 170 175 
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His Gin lie Val Arg Gin Trp Lys Tyr Lys Asp Asp Asp Asp Asp Gin 
180 185 190 



Leu Phe Tyr Thr Arg Leu Tyr Leu Asp Pro Gly Leu Arg Glu Lys Leu 
195 200 205 



Ser Leu Asn Leu Asp His Lys Ser Arg lie Phe Gin Asn Leu Asn Gly 
210 215 220 



Ala Leu Asp Glu Val Val Leu Lys Phe Asp Arg Asn Arg Val Arg lie 
225 230 235 240 



Arg Asn Val Ala Tyr Asp Thr Leu Pro He Val Val His Gly Asn Gly 
245 • 250 255 



Pro Thr Lys Leu Gin Leu Asn Tyr Leu Gly Asn Tyr Val Pro Asn Gly 
260 265 270 



Trp Thr Pro Glu Gly Gly Cys Gly Phe Cys Asn Gin Asp Arg Arg Thr 
275 280 285 



Leu Pro Gly Gly Gin Pro Pro Pro Arg Val Phe Leu Ala Val Phe Val 
290 295 300 



Glu Gin Pro Thr Pro Phe Leu Pro Arg Phe Leu Gin Arg Leu Leu Leu 
305 310 315 320 



Leu Asp Tyr Pro Pro Asp Arg Val Thr Leu Phe Leu His Asn Asn Glu 
325 330 335 



Val Phe His Glu Pro His He Ala Asp Ser Trp Pro Gin Leu Gin Asp 
340 345 350 



His Phe Ser Ala Val Lys Leu Val Gly Pro Glu Glu Ala Leu Ser Pro 
355 360 365 



Gly Glu Ala Arg Asp Met Ala Met Asp Leu Cys Arg Gin Asp Pro Glu 
370 375 380 



Cys Glu Phe Tyr Phe Ser Leu Asp Ala Asp Ala Val Leu Thr Asn Leu 
385 390 395 400 



Gin Thr Leu Arg He Leu He Glu Glu Asn Arg Lys Val He Ala Pro 
405 410 415 
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Met Leu Ser Arg His Gly Lys Leu Trp Ser Asn Phe Trp Gly Ala Leu 
420 425 430 



Ser Pro Asp Glu Tyr Tyr Ala Arg Ser Glu Asp Tyr Val Glu Leu Val 
435 440 445 



Gin Arg Lys Arg Val Gly Val Trp Asn Val Pro Tyr lie Ser Gin Ala 
450 455 460 



Tyr Val lie Arg Gly Asp Thr Leu Arg Met Glu Leu Pro Gin Arg Asp 
465 ~ 470 475 480 



Val Phe Ser Gly Ser Asp Thr Asp Pro Asp Met Ala Phe Cys Lys Ser 
485 490 495 



Phe Arg Asp Lys Gly He Phe Leu His Leu Ser Asn Gin His Glu Phe 
500 505 510 



Gly Arg Leu Leu Ala Thr Ser Arg Tyr Asp Thr Glu His Leu His Pro 
515 520 525 



Asp Leu Trp Gin He Phe Asp Asn Pro Val Asp Trp Lys Glu Gin Tyr 
530 535 540 



He His Glu Asn Tyr Ser Arg Ala Leu Glu Gly Glu Gly He Val Glu 
545 " 550 555 560 



Gin Pro Cys Pro Asp Val Tyr Trp Phe Pro Leu Leu Ser Glu Gin Met 
565 . 570 575 



Cys Asp Glu Leu Val Ala Glu Met Glu His Tyr Gly Gin Trp Ser Gly 
580 585 590 



Gly Arg His Glu Trp Glu Gly Thr Asn Gly Ala Gly Pro His Met Lys 
595 600 605 



Gly Gly Pro Gly Gin Val He Gly Ser Leu Ala Asp Asn Leu Ala Ala 
610 615 620 



Thr Gly Gly Gly Lys Gly Pro Ala Lys Ser Gly Pro Asn His Gin His 
625 ~ 630 635 640 



Arg Ala Arg Gly Gin Thr Gly Thr Pro Lys Gly Asp Thr Gly 
645 650 
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<210> 145 

<211> 669 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC_FEATURE 

<222> (613) . . (613) 

<223> X=any amino acid 



<400> 145 



Arg Arg Ser Gin Asp Pro Gly Cys Cys Leu Gly Pro Leu Pro Thr Met 
15 10 15 



Thr Ser Ser Gly Pro Gly Pro Arg Phe Leu Leu Leu Leu Pro Leu Leu 
20 25 30 



Leu Pro Pro Ala Ala Ser Ala Ser Asp Arg Pro Arg Gly Arg Asp Pro 
35 40 45 



Val Asn Pro Glu Lys Leu Leu Val He Thr Val Ala Thr Ala Glu Thr 
50 55 60 



Glu Gly Tyr Leu Arg Phe Leu Arg Ser Ala Glu Phe Phe Asn Tyr Thr 
65 70 75 80 



Val Arg Thr Leu Gly Leu Gly Glu Glu Trp Arg Gly Gly Asp Val Ala 
85 90 95 



Arg Thr Val Gly Gly Gly Gin Lys Val Arg Trp Leu Lys Lys Glu Met 
100 105 110 



Glu- Lys Tyr Ala Asp Arg Glu Asp Met He He Met Phe Val Asp Ser 
115 120 125 



Tyr Asp Val He Leu Ala Gly Ser Pro Thr Glu Leu Leu Lys Lys Phe 
130 135 140 



Val Gin Ser Gly Ser Arg Leu Leu Phe Ser Ala Glu Ser Phe Cys Trp 
145 ' 150 155 160 



Pro Glu Trp Gly Leu Ala Glu Gin Tyr Pro Glu Val Gly Thr Gly Lys 
165 170 175 



Arg Phe Leu Asn Ser Gly Gly Phe He Gly Phe Ala Thr Thr He His 
180 185 190 
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Gin lie Val Arg Gin Trp Lys Tyr Lys Asp Asp Asp Asp Asp Gin Leu 
195 200 205 



Phe Tyr Thr Arg Leu Tyr Leu Asp Pro Gly Leu Arg Glu Lys Leu Ser 
210 215 220 



Leu Asn Leu Asp His Lys Ser Arg He Phe Gin Asn Leu Asn Gly Ala 
225 230 235 240 



Leu Asp Glu Val Val Leu Lys Phe Asp Arg Asn Arg Val Arg He Arg 
245 250 255 



Asn Val Ala Tyr Asp Thr Leu Pro He Val Val His Gly Asn Gly Pro 
260 265 270 



Thr Lys Leu Gin Leu Asn Tyr Leu Gly Asn Tyr Val Pro Asn Gly Trp 
275 280 285 



Thr Pro Glu Gly Gly Cys Gly Phe Cys Asn Gin Asp Arg Arg Thr Leu 
290 295 300 



Pro Gly Gly Gin Pro Pro Pro Arg Val Phe Leu Ala Val Phe Val Glu 
305 310 315 320 



Gin Pro Thr Pro Phe Leu Pro Arg Phe Leu Gin Arg Leu Leu Leu Leu 
325 330 335 



Asp Tyr Pro Pro Asp Arg Val Thr Leu Phe Leu His Asn Asn Glu Val 
340 345 350 



Phe His Glu Pro His He Ala Asp Ser Trp Pro Gin Leu Gin Asp His 
355 360 365 



Phe Ser Ala Val Lys Leu Val Gly Pro Glu Glu Ala Leu Ser Pro Gly 
370 375 380 



Glu Ala Arg Asp Met Ala Met Asp Leu Cys Arg Gin Asp Pro Glu Cys 
385 390 395 400 



Glu Phe Tyr Phe Ser Leu Asp Ala Asp Ala Val Leu Thr Asn Leu Gin 
405 410 415 



Thr Leu Arg He Leu He Glu Glu Asn Arg Lys Val He Ala Pro Met 
420 425 430 



Leu Ser Arg His Gly Lys Leu Trp Ser Asn Phe Trp Gly Ala Leu Ser 
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435 



440 



445 



Pro Asp Glu Tyr Tyr Ala Arg Ser Glu Asp Tyr Val Glu Leu Val Gin 
450 455 460 



Arg Lys Arg Val Gly Val Trp Asn Val Pro Tyr lie Ser Gin Ala Tyr 
465 470 475 480 



Val He Arg Gly Asp Thr Leu Arg Met Glu Leu Pro Gin Arg Asp Val 
485 490 495 



Phe Ser Gly Ser Asp Thr Asp Pro Asp Met Ala Phe Cys Lys Ser Phe 
500 505 510 



Arg Asp Lys Gly He Phe Leu His Leu Ser Asn Gin His Glu Phe Gly 
515 520 525 



Arg Leu Leu Ala Thr Ser Arg Tyr Asp Thr Glu His Leu His Pro Asp 
530 535 540 



Leu Trp Gin He Phe Asp Asn Pro Val Asp Trp Lys Glu Gin Tyr He 
545 ' 550 555 560 



His Glu Asn Tyr Ser Arg Ala Leu Glu Gly Glu Gly lie Val Glu Gin 
565 570 575 



Pro Cys Pro Asp Val Tyr Trp Phe Pro Leu Leu Ser Glu Gin Met Cys 
580 585 590 



Asp Glu Leu Val Ala Glu Met Glu His Tyr Gly Gin Trp Ser Gly Gly 
595 600 605 



Arg His Glu Trp Xaa Gly Thr Asn Gly Ala Gly Pro His Met Lys Gly 
610 615 620 



Gly Pro Gly Gin Val lie Gly Ser Leu Ala Asp Asn Leu Ala Ala Thr 
625 630 635 640 



Gly Gly Gly Lys Gly Pro Ala Lys Ser Gly Pro Asn His Gin His Arg 
645 650 655 



Ala Arg Gly Gin Thr Gly Thr Pro Lys Gly Asp Thr Gly 
660 665 



<210> 146 
<211> 411 
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<212> PRT 

<213> Homo sapien 

<400> 146 

Met Gin Gly Pro Tyr Val Gin Gly Leu Gly Tyr Pro Glu Gly Arg Arg 
15 10 15 



Thr Ser Arg Asp Glu Leu Gly Arg He Trp Gly Ala Val Leu Pro Ala 
20 25 30 



Asp Arg Gly Val Gly Ala Ser Arg Gly Gly Val Trp Asn Val Pro Tyr 
35 40 45 



He Ser Gin Ala Tyr Val He Arg Gly Asp Thr Leu Arg Met Glu Leu 
50 55 60 



Pro Gin Arg Asp Val Phe Ser Gly Ser Asp Thr Asp Pro Asp Met Ala 
65 " ' 70 75 80 



Phe Cys Lys Ser Phe Arg Asp Lys Val Ser Ala Gly Ala Arg Ser Gly 
85 90 95 



Leu Gly Ala Val Pro Gin Thr Pro Gly He Ala Cys He Thr Asp Thr 
100 105 HO 



Pro Thr Pro Leu Thr Gly He Phe Leu His Leu Ser Asn Gin His Glu 
115 120 125 



Phe Gly Arg Leu Leu Ala Thr Ser Arg Tyr Asp Thr Glu His Leu His 
130 135 140 



Pro Asp Leu Trp Gin He Phe Asp Asn Pro Val Val Ser Gly Asp Pro 
145 150 155 160 



Ser Pro Glu His Thr Gly Ser Pro Pro Cys Ala Lys Glu Asp Thr Lys 
165 170 175 



Arg Asp Gly Cys Cys Gin Arg Arg Thr Leu Ser Gly Arg Gly Arg Ser 
180 185 190 



Pro Gly Gly Trp Glu Arg Gly Giy His Arg Ser Asp Val Pro Pro Thr 
195 " 200 205 



Leu Ser Gin Asp Trp Lys Glu Gin Tyr He His Glu Asn Tyr Ser Arg 
210 215 220 
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Ala Leu Glu Gly Glu Gly He Val Glu Gin Pro Cys Pro Asp Val Tyr 
225 * 230 235 240 



Trp Phe Pro Leu Leu Ser Glu Gin Met Cys Asp Glu Leu Val Ala Glu 
245 250 255 



Met Glu His Tyr Gly Gin Trp Ser Gly Gly Arg His Glu Asp Ser Arg 
260 265 270 



Leu Ala Gly Gly Tyr Glu Asn Val Pro Thr Val Asp He His Met Lys 
275 280 285 



Gin Val Gly Tyr Glu Asp Gin Trp Leu Gin Leu Leu Arg Thr Tyr Val 
290 295 300 



Gly Pro Met Thr Glu Ser Leu Phe Pro Gly Tyr His Thr Lys Ala Arg 
305 310 315 320 



Ala Val Met Asn Phe Val Val Arg Tyr Arg Pro Asp Glu Gin Pro Ser 
325 330 335 



Leu Arg Pro His His Asp Ser Ser Thr Phe Thr Leu Asn Val Ala Leu 
340 345 350 



Asn His Lys Gly Leu Asp Tyr Glu Gly Gly Gly Cys Arg Phe Leu Arg 
355 360 365 



Tyr Asp Cys Val He Ser Ser Pro Arg Lys Gly Trp Ala Leu Leu His 
370 375 380 



Pro Gly Arg Leu Thr His Tyr His Glu Gly Leu Pro Thr Thr Trp Gly 
385 390 395 400 



Thr Arg Tyr He Met Val Ser Phe Val Asp Pro 
405 410 



<210> 147 

<211> 255 

<212> PRT 

<213> Homo sapien 

<400> 147 

Val Gly Thr Pro Arg Pro Glu His Thr Gly Ser Pro Pro Cys Ala Lys 
1 5 .10 15 



Glu Asp Thr Lys Arg Asp Gly Cys Cys Gin Arg Arg Thr Leu Ser Gly 
20 25 30 
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Arg Gly Arg Ser Pro Gly Gly Trp Glu Arg Gly Gly His Arg Ser Asp 
35 40 45 



Val Pro Pro Thr Leu Ser Gin Asp Trp Lys Glu Gin Tyr lie His Glu 
50 55 60 



Asn Tyr Ser Arg Ala Leu Glu Gly Glu Gly lie Val Glu Gin Pro Cys 
65 70 75 80 



Pro Asp Val Tyr Trp Phe Pro Leu Leu Ser Glu Gin Met Cys Asp Glu 
85 90 95 



Leu Val Ala Glu Met Glu His Tyr Gly Gin Trp Ser Gly Gly Arg His 
100 105 110 



Glu Asp Ser Arg Leu Ala Gly Gly Tyr Glu Asn Val Pro Thr Val Asp 
115 120 125 



He His Met Lys Gin Val Gly Tyr Glu Asp Gin Trp Leu Gin Leu Leu 
130 135 140 



Arg Thr Tyr Val Gly Pro Met Thr Glu Ser Leu Phe Pro Gly Tyr His 
145 150 155 160 



Thr Lys Ala Arg Ala Val Met Asn Phe Val Val Arg Tyr Arg Pro Asp 
165 170 175 



Glu Gin Pro Ser Leu Arg Pro His His Asp Ser Ser Thr Phe Thr Leu 
180 185 190 



Asn Val Ala Leu Asn His Lys Gly Leu Asp Tyr Glu Gly Gly Gly Cys 
195 200 205 



Arg Phe Leu Arg Tyr Asp Cys Val He Ser Ser Pro Arg Lys Gly Trp 
210 215 220 



Ala Leu Leu His Pro Gly Arg Leu Thr His Tyr His Glu Gly Leu Pro 
225 230 235 240 



Thr Thr Trp Gly Thr Arg Tyr He Met Val Ser Phe Val Asp Pro 
245 250 255 



<210> 148 
<211> 312 
<212> PRT 
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<213> Homo sap i en 
<400> 148 

Met Arg Ser Pro Glu Leu Ala Leu Pro Arg Gly Met Gin Pro Thr Glu 
15 10 15 



Phe Phe Gin Ser Leu Gly Gly Asp Gly Glu Arg Asn Val Gin lie Glu 
20 25 30 



Met Ala His Gly. Thr Thr Thr Leu Ala Phe Lys Phe Gin His Gly Val 
35 40 45 



He Ala Ala Val Asp Ser Arg Ala Ser Ala Gly Ser Tyr He Ser Ala 
50 55 60 



Leu Arg Val Asn Lys Val He Glu He Asn Pro Tyr Leu Leu Gly Thr 
65 70 75 80 



Met Ser Gly Cys Ala Ala Asp Cys Gin Tyr Trp Glu Arg Leu Leu Ala 
85 90 95 



Lys Glu Cys Arg Leu Tyr Tyr Leu Arg Asn Gly Glu Arg He Ser Val 
100 105 110 



Ser Ala Ala Ser Lys Leu Leu Ser Asn Met Met Cys Gin Tyr Arg Gly 
115 120 125 



Met Gly Leu Ser Met Gly Ser Met He Cys Gly Trp Asp Lys Lys Gly 
130 135 140 



Pro Gly Leu Tyr Tyr Val Asp Glu His Gly Thr Arg Leu Ser Gly Asn 
145 150 155 160 



Met Phe Ser Thr Gly Ser Gly Asn Thr Tyr Ala Tyr Gly Val Met Asp 
165 170 175 



Ser Gly Tyr Arg Pro Asn Leu Ser Pro Glu Glu Ala Tyr Asp Leu Gly 
180 185 190 



Arg Arg Ala He Ala Tyr Ala Thr His Arg Asp Ser Tyr Ser Gly Gly 
195 200 205 



Val Val Asn Met Ser Thr Val Ser Ser Trp Ser Gly Pro Leu Glu Arg 
210 215 220 



Pro Met Ser Cys Pro Val Cys Pro Ser Gly Ser Ala Ser He Ala Ala 
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225 230 235 240 



Gly Ala Gly Gly Arg Pro Gly Arg Thr Cys Gly Pro Pro Gly His Thr 
245 250 255 



Leu Ser Pro Gly Lys Lys Gly Val Val Cys Gin Gin Lys Val Val Ser 
260 265 270 



Thr Ser Gly Lys His Met Gly Arg Tyr Lys Ser Arg Val Cys Ala Ala 
275 280 285 



Lys Met He He He Gly Gly Gly Ser Thr Pro Tyr Pro Tyr Asn Gly 
290 295 300 



Ser Arg Val His Thr Ser Tyr Tyr 

310 



305 




<210> 


149 


<211> 


299 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


149 



Leu Ser Phe Ser Arg Lys Arg Gin Gly Asp Val Glu Lys Ser Leu Val 
1 5 10 15 



Pro Ser Pro Ser He Cys Gly Phe Arg Phe His Phe Leu Leu Arg Glu 
20 25 30 



Arg Thr Asp Leu Trp Val Leu Gly Gly His Gly Ala Thr Arg Cys Met 
35 40 45 



Arg Ser Pro Arg Gly Gin Arg Pro Glu Ser Ala Leu Pro Val Ala Gly 
50 55 60 



Ser Gly Arg Arg Ser Asp Pro Gly His Tyr Ser Phe Ser Met Arg Ser 
65 70 75 80 



Pro Glu Leu Ala Leu Pro Arg Gly Met Gin Pro Thr Glu Phe Phe Gin 
85 90 95 



Ser Leu Gly Gly Asp Gly Glu Arg Asn Val Gin He Glu Met Ala His 
100 105 HO 



Gly Thr Thr Thr Leu Ala Phe Lys Phe Gin His Gly Val He Ala Ala 
115 120 125 



WO 2004/050858 



249/383 



PCT/US2003/038808 



Val Asp Ser Arg Ala Ser Ala Gly Ser Tyr He Ser Ala Leu Arg Val 
130 135 140 



Asn Lys Val He Glu He Asn Pro Tyr Leu Leu Gly Thr Met Ser Gly 
145 150 155 160 

Cys Ala Ala Asp Cys Gin Tyr Trp Glu Arg Leu Leu Ala Lys Glu Cys 
165 170 175 



Arg Leu Tyr Tyr Leu Arg Asn Gly Glu Arg He Ser Val Ser Ala Ala 
180 185 190 



Ser Lys Leu Leu Ser Asn Met Met Cys Gin Tyr Arg Gly Met Gly Leu 
195 200 205 



Ser Met Gly Ser Met He Cys Gly Trp Asp Lys Lys Gly Pro Gly Leu 
210 215 220 



Tyr Tyr Val Asp Glu His Gly Thr Arg Leu Ser Gly Asn Met Phe Ser 
225 2 30 235 240 



Thr Gly Ser Gly Asn Thr Tyr Ala Tyr Gly Val Met Asp Ser Gly Tyr 
245 250 " 255 



Arg Pro Asn Leu Ser Pro Glu Glu Ala Tyr Asp Leu Gly Arg Arg Ala 
2 $0 265 270 



lie Ala Tyr Ala Thr His Arg Asp Ser Tyr Ser Gly Gly Val Val Asn 
275 280 285 



Met Ser Thr Val Ser Ser Trp Ser Arg Ala Thr 
290 295 



<210> 150 

<211> 274 

<212> PRT 

<213> Homo sapien 

<400> 150 

Met Arg Ser Pro Glu Leu Ala Leu Pro Arg Gly Met Gin Pro Thr Glu 
1 5 .10 15 

Phe Phe Gin Ser Leu Gly Gly Asp Gly Glu Arg Asn Val Gin He Glu 
2 ° 25 30 

Met Ala His Gly Thr Thr Thr Leu Ala Phe Lys Phe Gin His Gly Val 
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35 40 ' 45 



He Ala Ala Val Asp Ser Arg Ala Ser Ala Gly Ser Tyr He Ser Ala 
50 55 60 



Leu Arg Val Asn Lys Val He Glu He Asn Pro Tyr Leu Leu Gly Thr 
65 70 75 80 



Met Ser Gly Cys Ala Ala Asp Cys Gin Tyr Trp Glu Arg Leu Leu Ala 
85 90 95 



Lys Glu Cys Arg Leu Tyr Tyr Leu Arg Asn Gly Glu Arg He Ser Val 
100 105 HO 



Ser Ala Ala Ser Lys Leu Leu Ser Asn Met Met Cys Gin Tyr Arg Gly 
115 120 125 



Met Gly Leu Ser Met Gly Ser Met He Cys Gly Trp Asp Lys Lys Gly 
130 135 140 



Pro Gly Leu Tyr Tyr Val Asp Glu His Gly Thr Arg Leu Ser Gly Asn 
145 ~ " 150 155 160 



Met Phe Ser Thr Gly Ser Gly Asn Thr Tyr Ala Tyr Gly Val Met Asp 
165 170 175 



Ser Gly Tyr Arg Pro Asn Leu Ser Pro Glu Glu Ala Tyr Asp Leu Gly 
180 185 190 



Arg Arg Ala He Ala Tyr Ala Thr His Arg Asp Ser Tyr Ser Gly Gly 
195 200 205 



Val Val Asn He Pro Glu Asp Gly Gly Asp Ser Glu Arg Pro Phe Gly 
210 215 220 



Arg Ala Ser Arg Asp His Gin Gly Gly Gly Gly Arg Pro Arg Gly Gly 
225 230 235 240 



Ser Gly Thr Pro Phe Arg Lys Ser Arg Lys Val Ser Arg Gly Gly Cys 
245 250 255 



Leu Asn Gin Trp Pro Lys Lys Gly Pro Ala Ala Tyr Trp Arg Ser Ser 
260 265 270 



Thr Tyr 
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<210> 151 

<211> 294 

<212> PRT 

<213> Homo sapien 

<400> 151 

Leu Ser Phe Ser Arg Lys Arg Gin Gly Asp Val Glu Lys Ser Leu Val 
1 5 10 15 



Pro Ser Pro Ser lie Cys Gly Phe Arg Phe His Phe Leu Leu Arg Glu 
20 25 30 



Arg Thr Asp Leu Trp Val Leu Gly Gly His Gly Ala Thr Arg Cys Met 
35 40 45 



Arg Ser Pro Arg Gly Gin Arg Pro Glu Ser Ala Leu Pro Val Ala Gly 
50 55 60 



Ser Gly Arg Arg Ser Asp Pro Gly His Tyr Ser Phe Ser Met Arg Ser 
65 70 75 80 



Pro Glu Leu Ala Leu Pro Arg Gly Met Gin Pro Thr Glu Phe Phe Gin 
85 90 95 



Ser Leu Gly Gly Asp Gly Glu Arg Asn Val Gin lie Glu Met Ala His 
100 105 110 



Gly Thr Thr Thr Leu Ala Phe Lys Phe Gin His Gly Val lie Ala Ala 
115 120 125 



Val Asp Ser Arg Ala Ser Ala Gly Ser Tyr lie Ser Ala Leu Arg Val 
130 135 140 



Asn Lys Val lie Glu lie Asn Pro Tyr Leu Leu Gly Thr Met Ser Gly 
145 150 155 160 



Cys Ala Ala Asp Cys Gin Tyr Trp Glu Arg Leu Leu Ala Lys Glu Cys 
165 170 175. 



Arg Leu Tyr Tyr Leu Arg Asn Gly Glu Arg lie Ser Val Ser Ala Ala 
180 185 190 



Ser Lys Leu Leu Ser Asn Met Met Cys Gin Tyr Arg Gly Met Gly Leu 
195 200 205 
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Ser Met Gly Ser Met He Cys Gly Trp Asp Lys Lys Gly Pro Gly Leu 
210 215 220 



Tyr Tyr Val Asp Glu His Gly Thr Arg Leu Ser Gly Asn Met Phe Ser 
225 230 235 240 



Thr Gly Ser Gly Asn Thr Tyr Ala Tyr Gly Val Met Asp Ser Gly Tyr 
245 250 255 



Arg Pro Asn Leu Ser Pro Glu Glu Ala Tyr Asp Leu Gly Arg Arg Ala 
260 265 270 



He Ala Tyr Ala Thr His Arg Asp Ser Tyr Ser Gly Gly Val Val Asn 
275 280 285 



He Pro Glu Glu Trp Trp 
290 



<210> 152 

<211> 237 

<212> PRT 

<213> Homo sapien 

<400> 152 

Met Arg Ser Pro Glu Leu Ala Leu Pro Arg Gly Met Gin Pro Thr Glu 
15 10 15 



Phe Phe Gin Ser Leu Gly Gly Asp Gly Glu Arg Asn Val Gin He Glu 
20 25 30 



Met Ala His Gly Thr Thr Thr Leu Ala Phe Lys Phe Gin His Gly Val 
35 40 45 



He Ala Ala Val Asp Ser Arg Ala Ser Ala Gly Ser Tyr He Ser Ala 
50 55 60 



Leu Arg Val Asn Lys Val He Glu He Asn Pro Tyr Leu Leu Gly Thr 
65 ' 70 75 80 



Met Ser Gly Cys Ala Ala Asp Cys Gin Tyr Trp Glu Arg Leu Leu Ala 
85 90 95 



Lys Glu Cys Arg Leu Tyr Tyr Leu Arg Asn Gly Glu Arg He Ser Val 
100 105 HO 



Ser Ala Ala Ser Lys Leu Leu Ser Asn Met Met Cys Gin Tyr Arg Gly 
115 120 125 
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Met Gly Leu Ser Met Gly Ser Met lie Cys Gly Trp Asp Lys Lys Gly- 
130 135 140 



Pro Gly Leu Tyr Tyr Val Asp Glu His Gly Thr Arg Leu Ser Gly Asn 
145 150 155 160 



Met Phe Ser Thr Gly Ser Gly Asn Thr Tyr Ala Tyr Gly Val Met Asp 
165 170 175 



Ser Gly Tyr Arg Pro Asn Leu Ser Pro Glu Glu Ala Tyr Asp Leu Gly 
180 185 190 



Arg Arg Ala lie Ala Tyr Ala Thr His Arg Asp Ser Tyr Ser Gly Gly 
195 200 205 



Val Val Asn Met Tyr His lie Asp Thr Thr Met Val Gly Glu Ser Lys 
210 215 220 



Val Asp Val Asp Leu Cys Pro Pro Glu Ala Gin Ser Gly 
225 230 235 



<210> 


153 


<211> 


297 


<212> 


PRT 


<213> 


Homo sapien 


<220> 




<221> 


MISC_FEATURE 


<222> 


(297) . . (297) 


<223> 


X=any amino acid 


<400> 


153 



Leu Ser Phe Ser Arg Lys Arg Gin Gly Asp Val Glu Lys Ser Leu Val 
1 5 10 15 



Pro Ser Pro Ser He Cys Gly Phe Arg Phe His Phe Leu Leu Arg Glu 
20 25 30 



Arg Thr Asp Leu Trp Val Leu Gly Gly His Gly Ala Thr Arg Cys Met 
35 40 45 



Arg Ser Pro Arg Gly Gin Arg Pro Glu Ser Ala Leu Pro Val Ala Gly 
50 55 60 



Ser Gly Arg Arg Ser Asp Pro Gly His Tyr Ser Phe Ser Met Arg Ser 
65 70 75 80 
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Pro Glu Leu Ala Leu Pro Arg Gly Met Gin Pro Thr Glu Phe Phe Gin 
85 90 95 



Ser Leu Gly Gly Asp Gly Glu Arg Asn Val Gin He Glu Met Ala His 
100 105 HO 



Gly Thr Thr Thr Leu Ala Phe Lys Phe Gin His Gly Val He Ala Ala 
115 120 125 



Val Asp Ser Arg Ala Ser Ala Gly Ser Tyr He Ser Ala Leu Arg Val 
130 135 140 



Asn Lys Val He Glu lie Asn Pro Tyr Leu Leu Gly Thr Met Ser Gly 
145 150 155 160 



Cys Ala Ala Asp Cys Gin Tyr Trp Glu Arg Leu Leu Ala Lys Glu Cys 
165 170 175 



Arg Leu Tyr Tyr Leu Arg Asn Gly Glu Arg He Ser Val Ser Ala Ala 
180 185 190 



Ser Lys Leu Leu Ser Asn Met Met Cys Gin Tyr Arg Gly Met Gly Leu 
195 200 205 



Ser Met Gly Ser Met He Cys Gly Trp Asp Lys Lys Gly Pro Gly Leu 
210 215 220 



Tyr Tyr Val Asp Glu His Gly Thr Arg Leu Ser Gly Asn Met Phe Ser 
225 230 ~ 235 240 



Thr Gly Ser Gly Asn Thr Tyr Ala Tyr Gly Val Met Asp Ser Gly Tyr 
245 250 255 



Arg Pro Asn Leu Ser Pro Glu Glu Ala Tyr Asp Leu Gly Arg Arg Ala 
260 265 270 



He Ala Tyr Ala Thr His Arg Asp Ser Tyr Ser Gly Gly Val Val Asn 
275 280 285 



Met Tyr His He Glu Arg Asp Gly Xaa 
290 295 



<210> 154 
<211> 260 
<212> PRT 
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<213> Homo sapien 
<400> 154 

Met Asp Ser Ser Leu Pro Asn Ala Gin His Asp Val Gin Gly Leu Ser 
15 10 15 



Pro Gin Glu Asn Leu Ser Pro Pro Ser Phe Pro Thr Ser Ser His Pro 
20 25 30 



Gin Val Cys Gin Lys Ser Gly Glu He Ser Leu Leu Lys Gin Gin Leu 
35 40 45 



Lys Glu Ser Gin Ala Glu Leu Val Gin Lys Gly Ser Glu Leu Val Ala 
50 55 60 



Leu Arg Val Ala Leu Arg Glu Ala Arg Ala Thr Leu Arg Val Ser Glu 
65 70 75 80 



Gly Arg Ala Arg Gly Leu Gin Glu Ala Ala Arg Ala Arg Glu Leu Glu 
85 90 95 



Leu Glu Ala Cys Ser Gin Glu Leu Gin Arg His Arg Gin Glu Ala Glu 
100 105 110 



Gin Leu Arg Glu Lys Ala Gly Gin Leu Asp Ala Glu Ala Ala Gly Leu 
115 120 125 



Arg Glu Pro Pro Val Pro Pro Ala Thr Ala Asp Pro Phe Leu Leu Ala 
130 135 140 



Glu Ser Asp Glu Ala Lys Val Gin Arg Ala Ala Ala Gly Val Gly Gly 
145 150 155 160 



Ser Leu Arg Ala Gin Val Glu Arg Leu Arg Val Glu Leu Gin Arg Glu 
165 170 175 



Arg Arg Arg Gly Glu Glu Gin Arg Asp Ser Phe Glu Gly Glu Arg Leu 
180 185 190 



Ala Trp Gin Ala Glu Lys Glu Gin Val He Arg Tyr Gin Lys Gin Leu 
195 200 205 



Gin His Asn Tyr He Gin Met Tyr Arg Arg Asn Arg Gin Leu Glu Gin 
210 215 220 



Glu Leu Gin Gin Leu Ser Leu Glu Leu Glu Ala Arg Glu Leu Ala Asp 
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225 230 235 240 



Leu Gly Leu Ala Glu Gin Ala Pro Cys lie Cys Leu Glu Glu lie Thr 
245 250 255 



Ala Thr Glu lie 
260 



<210> 155 

<211> 249 

<212> PRT 

<213> Homo sapien 

<400> 155 

Cys Ala Gly Pro Val Pro Pro Gly Glu Pro Val Ser Pro Ser Phe Pro 
15 10 15 



Thr Ser Ser His Pro Gin Val Cys Gin Lys Ser Gly Glu lie Ser Leu 
20 25 30 



Leu Lys Gin Gin Leu Lys Glu Ser Gin Ala Glu Leu Val Gin Lys Gly 
35 40 45 



Ser Glu Leu Val Ala Leu Arg Val Ala Leu Arg Glu Ala Arg Ala Thr 
50 55 60 



Leu Arg Val Ser Glu Gly Arg Ala Arg Gly Leu Gin Glu Ala Ala Arg 
65 70 75 80 



Ala Arg Glu Leu Glu Leu Glu Ala Cys Ser Gin Glu Leu Gin Arg His 
85 90 95 



Arg Gin Glu Ala Glu Gin Leu Arg Glu Lys Ala Gly Gin Leu Asp Ala 
100 105 110 



Glu Ala Ala Gly Leu Arg Glu. Pro Pro Val Pro Pro Ala Thr Ala Asp 
115 120 125 



Pro Phe Leu Leu Ala Glu Ser Asp Glu Ala Lys Val Gin Arg Ala. Ala 
130 135 140 



Ala Gly Val Gly Gly Ser Leu Arg Ala Gin Val Glu Arg Leu Arg Val 
145 150 155 160 



Glu Leu Gin Arg Glu Arg Arg Arg Gly Glu Glu Gin Arg Asp Ser Phe 
165 170 175 
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Glu Gly Glu Arg Leu Ala Trp Gin' Ala Glu Lys Glu Gin Val He Arg 
180 185 190 



Tyr Gin Lys Gin Leu Gin His Asn Tyr He Gin Met Tyr Arg Arg Asn 
195 200 205 



Arg Gin Leu Glu Gin Glu Leu Gin Gin Leu Ser Leu Glu Leu Glu Ala 
210 215 220 



Arg Glu Leu Ala Asp Leu Gly Leu Ala Glu Gin Ala Pro Cys He Cys 
225 230 235 240 



Leu Glu Glu He Thr Ala Thr Glu He 
245 



<210> 156 

<211> 132 

<212> PRT 

<213> Homo sapien 

<400> 156 

Met Arg Gly Ser His Leu Pro Gin Pro Gin Thr His Phe Ser Lys Trp 
15 10 15 



Ser Leu His Leu Pro Leu Leu Leu Phe Arg Ala Pro Arg Ala Gly Ala 
20 25 30 



Gly Leu Ala Gly Pro Gly Phe Pro Asp Ser Trp Leu Trp Leu Pro Gly 
35 40 45 



Cys Val Ala Leu Arg Ala Arg Leu Cys Gly Thr Leu Asp Leu Trp Ala 
50 55 60 



Leu Gly Ser Val Gly Leu Gly Gly Pro Gly Ser Leu Gly Pro Ala Gly 
65 70 75 80 



Cys Pro Leu Val Gly Leu Thr Leu Gly Pro Leu Arg Val Phe Val Ser 
85 90 95 



Phe Gly Ser Leu Ser Phe Ser Glu Pro Val Ser Phe Leu Gin Phe Ser 
100 105 110 



Leu Ser Pro Cys Ser Leu Lys Trp Gly Asn Ala Thr Glu Val Pro Ala 
115 120 125 



Pro Leu Ala Pro 
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130 



<210> 157 

<211> 225 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MIS C_FE ATURE 

<222> (23).. (23) 

<223> X=any amino acid 



<400> 157 

Ala Trp Ser Ser Ala Ala Glu Val Arg Gly Ala Gly Ser Gly Arg Arg 
1 5 10 15 



Pro Gly Glu Pro Pro Thr Xaa Ala Arg Trp Asn Pro Ala Gly Gly Thr 
20 25 30 



Ala Phe Pro Ser Arg Pro His Phe Leu Glu Gly Gly Leu Asp Ser Gly 
35 40 45 



Glu Trp Val Arg Gly Ser Trp Leu Pro Ala Ala Glu Trp Glu Leu Val 
50 55 60 



He Ala Asp Leu Thr Pro Leu Asp Trp Ser Leu Glu Phe Gly Trp Glu 
65 l ' 70 75 80 



Gly Asp Phe Pro Gin Leu Phe Arg Ser Val Ser Gly Pro Glu Pro Ala 
85 90 95 



Pro Ala Leu Cys Ser Pro Gin Pro Thr Lys Arg Gly Val Leu Leu Gly 
100 105 HO 



Gin Ser Asp Cys Ser Gly Ser Ala Gly Glu Asn Glu Gly Phe Pro Ser 
115 120 125 



Pro Thr Ala Thr Asn Ala Phe Leu Lys Met Glu Pro Pro Leu Ala Pro 
130 135 140 



Pro Ser Leu Ser Gly Ser Lys Ser Gly Ser Arg Pro Gly Trp Ala Trp 
145 " 150 155 160 



Leu Ser Arg Leu Leu Ala Leu Ala Pro Trp Met Cys Gly Ser Gin Gly 
165 170 175 



Pro Ala Leu Trp His Phe Gly Leu Met Gly Pro Arg Leu Arg Gly Val 



WO 2004/050858 PCT/US2003/038808 

259/383 

10/538002 

180 185 190 

JC17Rec'dPCT/PT0 03 JUN 2005 

Gly Gly Pro Trp Leu Thr Gly Ser Arg Trp Leu Ser Pro Cys Arg Pro 
195 200 205 

Asp Pro Arg Ala Ser Pro Ser Leu Cys Leu Phe Trp Val Pro Val Phe 
210 215 220 



Leu 




225. 




<210> 


158 


<211> 


451 


<212> 


PRT 


<213> 


Homo sap i en 


<400> 


158 



Ala Ser Ala Thr Met Ala He Val Gin Thr Leu Pro Val Pro Leu Glu 
1 5 io 15 



Pro Ala Pro Glu Ala Ala Thr Ala Pro Gin Ala Pro Val Met Gly Ser 
20 25 30 



Val Ser Ser Leu He Ser Gly Arg Pro Cys Pro Gly Gly Pro Ala Pro 
35 40 45 



Pro Arg His His Gly Pro Pro Gly Pro Thr Phe Phe Arg Gin Gin Asp 
50 55 60 



Gly Leu Leu Arg Gly Gly Tyr Glu Ala Gin Glu Pro Leu Cys Pro Ala 
65 70 75 " 80 

Val Pro Pro Arg Lys Ala Val Pro Val Thr Ser Phe Thr Tyr He Asn 
85 90 95 

Glu Asp Phe Arg Thr Glu Ser Pro Pro Ser Pro Ser Ser Asp Val Glu 
100 105 no 



Asp Ala Arg Glu Gin Arg Ala His Asn Ala His Leu Arg Gly Pro Pro 
115 120 125 



Pro Lys Leu He Pro Val Ser Gly Lys Leu Glu Lys Asn Met Glu Lys 
130 135 140 



He Leu He Arg Pro Thr Ala Phe Lys Pro Val Leu Pro Lys Pro Arg 
145 150 155 160 
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Gly Ala Pro Ser Leu Pro Ser Phe Met Gly Pro Arg Ala Thr Gly Leu 
165 170 175 



Ser Gly Ser Gin Gly Ser Leu Thr Gin Leu Phe Gly Gly Pro Ala Ser 
180 185 190 



Ser Ser Ser Ser Ser Ser Ser Ser Ser Ala Ala Asp Lys Pro Leu Ala 
195 200 205 



Phe Ser Gly Trp Ala Ser Gly Cys Pro Ser Gly Thr Leu Ser Asp Ser 
210 ~ 215 220 



Gly Arg Asn Ser Leu Ser Ser Leu Pro Thr Tyr Ser Thr Gly Gly Ala 
225 " 230 235 240 



Glu Pro Thr Thr Ser Ser Pro Gly Gly His Leu Pro Ser His Gly Ser 
245 250 255 



Gly Arg Gly Ala Leu Pro Gly Pro Ala Arg Gly Val Pro Thr Gly Pro 
260 265 270 



Ser His Ser Asp Ser Gly Arg Ser Ser Ser Ser Lys Ser Thr Gly Ser 
275 280 285 

Leu Gly Gly Arg Val Ala Gly Gly Leu Leu Gly Ser Gly Thr Arg Ala 
290 295 300 



Ser Pro Asp Ser Ser Ser Cys Gly Glu Arg Ser Pro Pro Pro Pro Pro 
305 310 315 320 



Pro Pro Pro Ser Asp Glu Ala Leu Leu His Cys Val Leu Glu Gly Lys 
325 330 335 



Leu Arg Asp Arg Glu Ala Glu Leu Gin Gin Leu Arg Asp Ser Leu Asp 
340 345 350 



Glu Asn Glu Ala Thr Met Cys Gin Ala Tyr Glu Glu Arg Gin Arg His 
355 360 365 



Trp Gin Arg Glu Arg Glu Ala Leu Arg Glu Asp Cys Ala Ala Gin Ala 
370 375 380 



Gin Arg Ala Gin Arg Ala Gin Gin Leu Leu Gin Leu Gin Val Phe Gin 
385 ^ 390 395 400 
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Leu Gin Gin Glu Lys Arg Gin Leu Gin Asp Asp Phe Ala Gin Leu Leu 
405 410 415 



Gin Glu Arg Glu Gin Leu Glu Arg Arg Cys Ala Thr Leu Glu Arg Glu 
420 425 430 



Arg Arg Arg Gly Glu Glu Gin Arg Asp Ser Phe Glu Gly Glu Arg Asp 
435 440 445 



His Cys Tyr 
450 



<210> 159 

<211> 1193 

<212> PRT 

<213> Homo sapien 

<400> 159 

Met Glu Arg His Gin Pro Arg Leu His His Pro Ala Gin Gly Ser Ala 
15 10 15 



Ala Gly Thr Pro Tyr Pro Ser Ser Ala Ser Leu Arg Gly Cys Arg Glu 
20 25 30 



Ser Lys Met Pro Arg Arg Lys Gly Pro Gin His Pro Pro Pro Pro Ser 
35 40 45 



Gly Pro Glu Glu Pro Gly Glu Lys Arg Pro Lys Phe His Leu Asn lie 
50 55 60 



Arg Thr Leu Thr Asp Asp Met Leu Asp Lys Phe Ala Ser lie Arg lie 
65 70 75 80 



Pro Gly Ser Lys Lys Glu Arg Pro Pro Leu Pro Asn Leu Lys Thr Ala 
85 90 95 



Phe Ala Ser Ser Asp Cys Ser Ala Ala Pro Leu Glu Met Met Glu Asn 
100 105 110 



Phe Pro Lys Pro Leu Ser Glu Asn Glu Leu Leu Glu Leu Phe Glu Lys 
115 120 125 



Met Met Glu Asp Met Asn Leu Asn Glu Asp Lys Lys Ala Pro Leu Arg 
130 135 "* 140 



Glu Lys Asp Phe Ser lie Lys Lys Glu Met Val Met Gin Tyr lie Asn 
145 150 155 160 
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Thr Ala Ser Lys Thr Gly Ser Leu Lys Arg Ser Arg Gin lie Ser Pro 
165 170 175 



Gin Glu Phe lie His Glu Leu Lys Met Gly Ser Ala Asp Glu Arg Leu 
180 185 190 



Val Thr Cys Leu Glu Ser Leu Arg Val Ser Leu Thr Ser Asn Pro Val 
195 200 205 



Ser Trp Val Glu Ser Phe Gly His Glu Gly Leu Gly Leu Leu Leu Asp 
210 215 220 



lie Leu Glu Lys Leu lie Ser Gly Lys lie Gin Glu Lys Val Val Lys 
225 230 235 240 



Lys Asn Gin His Lys Val lie Gin Cys Leu Lys Ala Leu Met Asn Thr 
245 250 255 



Gin Tyr Gly Leu Glu Arg lie Met Ser Glu Glu Arg Ser Leu Ser Leu 
260 265 270 



Leu Ala Lys Ala Val Asp Pro Arg His Pro Asn Met Met Thr Asp Val 
275 280 285 



Val Lys Leu Leu Ser Ala Val Cys lie Val Gly Glu Glu Ser lie Leu 
290 295 300 



Glu Glu Val Leu Glu Ala Leu Thr Ser Ala Gly Glu Glu Lys Lys lie 
305 310 315 320 



Asp Arg Phe Phe Cys lie Val Glu Gly Leu Arg His Asn Ser Val Gin 
325 330 335 



Leu Gin Val Ala Cys Met Gin Leu He Asn Ala Leu Val Thr Ser Pro 
340 345 350 



Asp Asp Leu Asp Phe Arg Leu His He Arg Asn Glu Phe Met Arg Cys 
355 " 360 365 



Gly Leu Lys Glu He Leu Pro Asn Leu Lys Cys He Lys Asn Asp Gly 
370 375 380 



Leu Asp He Gin Leu Lys Val Phe Asp Glu His Lys Glu Glu Asp Leu 
385 390 395 400 
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Phe Glu Leu Ser His Arg Leu Glu Asp lie Arg Ala Glu Leu Asp Glu 
405 410 415 



Ala Tyr Asp Val Tyr Asn Met Val Trp Ser Thr Val Lys Glu Thr Arg 
420 425 430 



Ala Glu Gly Tyr Phe lie Ser lie Leu Gin His Leu Leu Leu He Arg 
435 440 445 



Asn Asp Tyr Phe He Arg Gin Gin Tyr Phe Lys Leu He Asp Glu Cys 
450 455 460 



Val Ser Gin He Val Leu His Arg Asp Gly Met Asp Pro Asp Phe Thr 
465 470 475 480 



Tyr Arg Lys Arg Leu Asp Leu Asp Leu Thr Gin Phe Val Asp He Cys 
485 490 495 



lie Asp Gin Ala Lys Leu Glu Glu Phe Glu Glu Lys Ala Ser Glu Leu 
500 505 510 



Tyr Lys Lys Phe Glu Lys Glu Phe Thr Asp His Gin Glu Thr Gin Ala 
515 520 525 



Glu Leu Gin Lys Lys Glu Ala Lys He Asn Glu Leu Gin Ala Glu Leu 
530 535 540 



Gin Ala Phe Lys Ser Gin Phe Gly Ala Leu Pro Ala Asp Cys Asn He 
545 550 555 560 



Pro Leu Pro Pro Ser Lys Glu Gly Gly Thr Gly His Ser Ala Leu Pro 
565 ~ 570 575 



Pro Pro Pro Pro Leu Pro Ser Gly Gly Gly Val Pro Pro Pro Pro Pro 
580 585 590 



Pro Pro Pro Pro Pro Pro Leu Pro Gly Met Arg Met Pro Phe Ser Gly 
595 600 605 



Pro Val Pro Pro Pro Pro Pro Leu Gly Phe Leu Gly Gly Gin Asn Ser 
610 615 620 



Pro Pro Leu Pro He Leu Pro Phe Gly Leu Lys Pro Lys Lys Glu Phe 
625 630 635 640 
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Lys Pro Glu lie Ser Met Arg Arg Leu Asn Trp Leu Lys lie Arg Pro 
645 650 655 



His Glu Met Thr Glu Asn Cys Phe Trp lie Lys Val Asn Glu Asn Lys 
660 665 670 



Tyr Glu Asn Val Asp Leu Leu Cys Lys Leu Glu Asn Thr Phe Cys Cys 
675 680 685 



Gin Gin Lys Glu Arg Arg Glu Glu Glu Asp lie Glu Glu Lys Lys Ser 
690 695 700 



lie Lys Lys Lys lie Lys Glu Leu Lys Phe Leu Asp Ser Lys lie Ala 
705 710 715 720 



Gin Asn Leu Ser lie Phe Leu Ser Ser Phe Arg Val Pro Tyr Glu Glu 
725 730 735 



lie Arg Met Met lie Leu Glu Val Asp Glu Thr Arg Leu Ala Glu Ser 
740 745 750 



Met lie Gin Asn Leu lie Lys His Leu Pro Asp Gin Glu Gin Leu Asn 
755 760 765 



Ser Leu Ser Gin Phe Lys Ser Glu Tyr Ser Asn Leu Cys Glu Pro Glu 
770 775 780 



Gin Phe Val Val Val Met Ser Asn Val Lys Arg Leu Arg Pro Arg Leu 
785 790 795 800 



Ser Ala lie Leu Phe Lys Leu Gin Phe Glu Glu Gin Val Asn Asn He 
805 810 815 



Lys Pro Asp He Met Ala Val Ser Thr Ala Cys Glu Glu He Lys Lys 
820 825 830 



Ser Lys Ser Phe Ser Lys Leu Leu Glu Leu Val Leu Leu Met Gly Asn 
835 840 845 



Tyr Met Asn Ala Gly Ser Arg Asn Ala Gin Thr Phe Gly Phe Asn Leu 
850 855 860 



Ser Ser Leu Cys Lys Leu Lys Asp Thr Lys Ser Ala Asp Gin Lys Thr 
865 870 875 880 



Thr Leu Leu His Phe Leu Val Glu He Cys Glu Glu Lys Tyr Pro Asp 
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885 890 895 



lie Leu Asn Phe Val Asp Asp Leu Glu Pro Leu Asp Lys Ala Ser Lys 
900 905 910 



Val Ser Val Glu Thr Leu Glu Lys Asn Leu Arg Gin Met Gly Arg Gin 
915 920 925 



Leu Gin Gin Leu Glu Lys Glu Leu Glu Thr Phe Pro Pro Pro Glu Asp 
930 935 940 



Leu His Asp Lys Phe Val Thr Lys Met Ser Arg Phe Val He Ser Ala 
945 950 955 960 



Lys Glu Gin Tyr Glu Thr Leu Ser Lys Leu His Glu Asn Met Glu Lys 
965 970 975 



Leu Tyr Gin Ser He He Gly Tyr Tyr Ala He Asp Val Lys Lys Val 
980 985 990 



Ser Val Glu Asp Phe Leu Thr Asp Leu Asn Asn Phe Arg Thr Thr Phe 
995 1000 1005 



Met Gin Ala He Lys Glu Asn lie Lys Lys Arg Glu Ala Glu Glu 
1010 1015 1020 



Lys Glu Lys Arg Val Arg He Ala Lys Glu Leu Ala Glu Arg Glu 
1025 1030 1035 



Arg Leu Glu Arg Gin Gin Lys Lys Lys Arg Leu Leu Glu Met Lys 
1040 1045 1050 



Thr Glu Gly Asp Glu Thr Gly Val Met Asp Asn Leu Leu Glu Ala 
1055 1060 1065 



Leu Gin Ser Gly Ala Ala Phe Arg Asp Arg Arg Lys Arg Thr Pro 
1070 1075 1080 



Met Pro Lys Asp Val Arg Gin Ser Leu Ser Pro Met Ser Gin Arg 
1085 1090 1095 



Pro Val Leu Lys Val Cys Asn His Glu Asn Gin Lys Val Gin Leu 
1100 1105 1110 



Thr Glu Gly Ser Arg Ser His Tyr Asn lie Asn Cys Asn Ser Thr 
H15 1120 1125 
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Arg Thr Pro Val Ala Lys Glu Leu Asn Tyr Asn Leu Asp Thr His 
1130 1135 1140 



Thr Ser Thr Gly Arg lie Lys Ala Ala Glu Lys Lys Glu Ala Cys 
1145 1150 1155 



Asn Val Glu Ser Asn Arg Lys Lys Glu Thr Glu Leu Leu Gly Ser 
1160 1165 1170 



Phe Ser Lys Asn Glu Ser Val Pro Glu Val Glu Ala Leu Leu Ala 
1175 1180 1185 



Arg Leu Arg Ala Leu 
1190 



<210> 160 

<211> 650 

<212> PRT 

<213> Homo sapien 

<400> 160 

Met Ser Leu Ala Asp Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu 
15 10 15 



Glu Glu Glu Gly Gly Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala lie 
20 25 30 



Glu Asp Val Gin Glu Glu Thr Gin Leu Asp Leu Ser Gly Asp Ser Val 
35 40 45 



Lys Thr lie Ala Lys Leu Trp Asp Ser Lys Met Phe Ala Glu lie Met 
50 55 60 



Met Lys He Glu Glu Tyr He Ser Lys Gin Ala Lys Ala Ser Glu Val 
65 70 75 80 



Met Gly Pro Val Glu Ala Ala Pro Glu Tyr Arg Val He Val Asp Ala 
85 90 95 . 



Asn Asn Leu Thr Val Glu He Glu Asn Glu Leu Asn He He His Lys 
100 105 110 



Phe He Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu 
115 ^ 120 125 
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Val Pro Asn Ala Leu Asp Tyr lie Arg Thr Val Lys Glu Leu Gly Asn 
130 135 140 



Ser Leu Asp Lys Cys Lys Asn Asn Glu Asn Leu Gin Gin lie Leu Thr 
145 150 155 160 



Asn Ala Thr He Met Val Val Ser Val Thr Ala Ser Thr Thr Gin Gly 
165 170 175 



Gin Gin Leu Ser Glu Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp 
180 185 190 



Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg He Tyr Glu Tyr Val 
195 200 205 



Glu Ser Arg Met Ser Phe He Ala Pro Asn Leu Ser He He He Gly 
210 215 220 



Ala Ser Thr Ala Ala Lys He Met Gly Val Ala Gly Gly Leu Thr Asn 
225 230 235 240 



Leu Ser Lys Met Pro Ala Cys Asn He Met Leu Leu Gly Ala Gin Arg 
245 250 255 



Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly 
260 265 270 



Tyr He Tyr His Ser Asp He Val Gin Ser Leu Pro Pro Asp Leu Arg 
275 280 285 



Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg 
290 295 300 



Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu 
305 310 315 320 



Lys Asp Glu He Glu Arg Lys Phe Asp Lys Trp Gin Glu Pro Pro Pro 
325 330 335 



Val Lys Gin Val Lys Pro Leu Pro Ala Pro Leu Asp Gly Gin Arg Lys 
340 345 350 



Lys Arg Gly Gly Arg Ser Val Arg Gly Pro Gly Gly Pro Val Gly Met 
355 360 365 



Gly Val Met Glu Gly Arg Ser Arg Arg Pro Pro Pro Ser Arg Leu Pro 
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370 375 380 



Gly Ala Ala His Pro Pro Val Pro Val Pro Gin Asp Glu Gly Ala Ala 
385 390 395 400 



Gly Ala Asp Gly Asp Pro Glu Ala Gly Gin Pro Tyr Glu Leu Arg Arg 
405 410 415 



Gly Gin Thr Pro Arg Ala Pro Ser Ser Thr Pro Gin Pro Ala Ser Arg 
420 425 430 



His Arg Pro Leu Pro Pro Ala Thr Ala Pro Pro Leu Val Leu Trp Pro 
435 440 445 



Trp Leu Met Ser Arg Ala Leu Pro Gin Pro Pro Pro Pro Arg Pro Leu 
450 455 460 



Phe Ser Phe Pro Ser lie Gin Pro Gin Ser Asp Pro Arg Gly Pro Trp 
465 470 475 480 



Ser Leu Cys Leu Arg Cys Leu Glu Pro Pro Arg Leu Pro He Ala Pro 
485 490 495 



Gly Ser Leu Ala Gly Ser Ser Leu Pro Arg Gly Ser Leu Val Pro Cys 
500 505 510 



Cys Thr Ala Ala Pro Ser Leu Gly Pro Ala Ser Leu Leu Cys Tyr Pro 
515 520 525 



Ser Val He Pro Leu Val Leu Gin Asp Arg Thr Gin Arg Pro Pro His 
530 535 540 



Pro He Lys. Pro Val Leu Val Pro Asp He Pro Arg Pro Thr Arg He 
545 ~ 550 555 560 



Glu Glu Asp Ala Tyr Gin Glu Asp Leu Gly Phe Ser Leu Gly His Leu 
565 570 575 



Gly Lys Ser Gly Ser Gly Arg Val Arg Gin Thr Gin Val Asn Glu Ala 
580 585 590 



Thr Lys Ala Arg He Ser Lys Thr Leu Gin Arg Thr Leu Gin Lys Gin 
595 600 605 



Ser Val Val Tyr Gly Gly Lys Ser Thr He Arg Asp Arg Ser Ser Gly 
610 615 620 
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Thr Ala Ser Ser Val Ala Phe Thr Pro Leu Gin Gly Pro Gly Asp Cys 
625 630 635 640 



Glu Pro Thr Gly Gly Arg Glu Glu Gly Gly 
645 650 



<210> 161 

<211> 369 

<212> PRT 

<213> Homo sapien 

<400> 161 

Lys Gin Trp Cys Ala Glu Arg Arg Gly Leu Gly Met Ser Leu Ala Asp 
15 10 15 



Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu Glu Glu Glu Gly Gly 
20 25 30 



Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala He Glu Asp Val Gin Glu 
35 40 45 



Glu Thr Gin Leu Asp Leu Ser Gly Asp Ser Val Lys Thr He Ala Lys 
50 55 60 



Leu Trp Asp Ser Lys Met Phe Ala Glu He Met Met Lys He Glu Glu 
65 70 75 80 



Tyr He Ser Lys Gin Ala Lys Ala Ser Glu Val Met Gly Pro Val Glu 
85 90 95 



Ala Ala Pro Glu Tyr Arg Val lie Val Asp Ala Asn Asn Leu Thr Val 
100 105 110 



Glu He Glu Asn Glu Leu Asn He He His Lys Phe He Arg Asp Lys 
115 120 125 



Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu Val Pro Asn Ala Leu 
130 135 140 



Asp Tyr He Arg Thr Val Lys Glu Leu Gly Asn Ser Leu Asp Lys Cys 
145 150 155 160 



Lys Asn Asn Glu Asn Leu Gin Gin He Leu Thr Asn Ala Thr He Met 
165 170 175 
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Val Val Ser Val Thr Ala Ser Thr Thr Gin Gly Gin Gin Leu Ser Glu 
180 185 190 



Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp Met Ala Leu Glu Leu 
195 200 205 



Asn Ala Ser Lys His Arg He Tyr Glu Tyr Val Glu Ser Arg Met Ser 
210 215 220 



Phe He Ala Pro Asn Leu Ser He He He Gly Ala Ser Thr Ala Ala 
225 230 235 240 



Lys He Met Gly Val Ala Gly Gly Leu Thr Asn Leu Ser Lys Met- Pro 
245 250 255 



Ala Cys Asn He Met Leu Leu Gly Ala Gin Arg Lys Thr Leu Ser Gly 
260 265 270 



Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly Tyr He Tyr His Ser 
275 280 285 



Asp He Val Gin Ser Leu Pro Pro Asp Leu Arg Arg Lys Ala Ala Arg 
290 295 300 



Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg Val Asp Ser Phe His 
305 310 315 320 



Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu Lys Asp Glu He Glu 
325 330 335 



Arg Lys Phe Asp Lys Trp Gin Glu Pro Pro Pro Val Lys Gin Val Lys 
340 345 350 



Pro Leu Pro Ala Pro Leu Asp Gly Gin Arg Lys Lys Arg Gly Gly Arg 
355 360 365 



Arg 



<210> 162 

<211> 164 

<212> PRT 

<213> Homo sapien 

<400> 162 

Arg Arg Pro Tyr Ala Gly Thr Arg Leu Pro Val Gly Ser Pro Gly Leu 
15 10 15 
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Phe Leu Asn Ala Ala Ala Ala Thr Pro Arg Thr Pro Ser Val Thr Gly 
20 25 30 



Ala Thr Glu Thr Val Thr Pro Ser Glu Ala Pro Val Leu Ala Ala Glu 
35 40 45 



Pro Glu Ala Asp Lys Gly Thr Val Leu Ala Leu Thr Glu Asn Asn Phe 
50 55 60 



Asp Asp Thr He Ala Glu Gly He Thr Phe He Lys Phe Tyr Ala Pro 
65 70 75 80 



Trp Cys Gly His Cys Lys Thr Leu Ala Pro Thr Trp Glu Glu Leu Ser 
85 90 95 



Lys Lys Glu Phe Pro Gly Leu Ala Gly Val Lys He Ala Glu Val Asp 
100 105 110 



Cys Thr Ala Glu Arg Asn He Cys Ser Lys Tyr Ser Val Arg Gly Tyr 
115 120 125 



Pro Thr Leu Leu Leu Phe Arg Gly Gly Lys Lys Val Ser Glu His Ser 
130 135 140 



Gly Gly Arg Asp Leu Asp Ser Leu His Arg Phe Val Leu Ser Gin Ala 
145 150 155 160 



Lys Asp Glu Leu 



<210> 163 

<211> 31 

<212> PRT 

<213> Homo sapien 

<400> 163 

Thr Asp Gin His Ser He Glu Ser Thr Val Thr Leu Arg Ser Glu Glu 
1 5 10 15 



Arg Gin Tyr Cys His Lys Ala Thr Phe Pro Glu Leu Val Gly Arg 
20 25 30 



<210> 164 

<211> 110 

<212> PRT 

<213> Homo sapien 
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<400> 164 

Pro Leu Cys lie Gin His Ser Ser Pro Thr Ser His Thr Gin Gly Asp 
15 10 15 



Val Asp Thr Trp Pro Lys Val Thr Gly Gly Arg Asn Leu Arg Asn Lys 
20 25 30 



Thr Thr Tyr Thr Val Cys Leu Arg Gin Lys lie Thr Ala Ala Ser Arg 
35 40 45 



Pro Ala Ser Ala Leu Lys Glu lie Phe lie Asn His Val Trp Phe Thr 
50 55 60 



Asp Asn Ser Phe Phe Lys Lys Thr Gin Pro Pro Arg Glu Ala Gin Leu 
65 70 " 75 80 



Ser Arg Val Leu Tyr Thr Gin Leu Gin Leu Cys lie Thr Ser Leu Val 
85 90 95 



Phe Gin Glu Asn Gin Ser Gly Thr He Cys Leu Phe Thr Leu 
100 105 HO 



<210> 165 

<211> 39 

<212> PRT 

<213> Homo sapien 

<400> 165 

Met Val Thr Leu Glu Phe Pro Tyr Glu Arg Arg Asp Val Lys Thr Lys 
15 10 15 



Asp Arg Cys Gin Trp Ala Ala Leu Ala* Leu Val Cys Thr Ala Val Ala 
20 25 30 



Ala Val Asp Ala Ser Val Leu 
35 



<210> 166 

<211> 110 

<212> PRT 

<213> Homo sapien 

<400> 166 

Pro Leu Cys He Gin His Ser Ser Pro Thr Ser His Thr Gin Gly Asp 
1 5 10 15 
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Val Asp Thr Trp Pro Lys Val Thr Gly Gly Arg Asn Leu Arg Asn Lys 
20 25 30 



Thr Thr Tyr Thr Val Cys Leu Arg Gin Lys lie Thr Ala Ala Ser Arg 
35 40 45 



Pro Ala Ser Ala Leu Lys Glu He Phe He Asn His Val Trp Phe Thr 
50 55 60 



Asp Asn Ser Phe Phe Lys Lys Thr Gin Pro Pro Arg Glu Ala Gin Leu 
65 70 75 80 



Ser Arg Val Leu Tyr Thr Gin Leu Gin Leu Cys He Thr Ser Leu Val 
85 90 95 



Phe Gin Glu Asn Gin Ser Gly Thr He Cys Leu Phe Thr Leu 
100 105 110 



<210> 167 

<211> 140 

<212> PRT 

<213> Homo sapien 

<400> 167 

Met Val Gly Arg Arg Ala Leu He Val Leu Ala His Ser Glu Arg Thr 
15 10 15 



Ser Phe Asn Tyr Ala Met Lys Glu Ala Ala Ala Ala Ala Leu Lys Lys 
20 25 30 



Lys Gly Trp Glu Val Val Glu Ser Asp Leu Tyr Ala Met Asn Phe Asn 
35 40 45 



Pro lie He Ser Arg Lys Asp He Thr Gly Lys Leu Lys Asp Pro Ala 
50 55 60 



Asn Phe Gin Tyr Pro Ala Glu Ser Val Leu Ala Tyr Lys Glu Gly His 
65 70 75 80 



Leu Ser Pro Asp He Val Ala Glu Gin Lys Lys Leu Glu Ala Ala Asp 
85 90 95 



Leu Val He Phe Gin Val Trp Gly Asp lie Gly Arg Gly Val Arg Asp 
100 105 " 110 



lie Cys Val Leu He Val Leu Asp Met Tyr Leu He Ser Tyr Glu He 
115 120 125 



WO 2004/050858 



274/383 



PCT/US2003/038808 



Leu Asn Ser Tyr Pro Leu Tyr Cys He Leu Cys Lys 
130 135 140 



<210> 168 

<211> 73 

<212> PRT 

<213> Homo sapien 

<400> 168 

Phe Glu Ser Val Pro Cys Lys Gly Glu Tyr Glu Thr Cys Asn Ser Asp 
1 5 10 15 



Asn Leu Asn His Ala Val Leu Ala Val Gly Tyr Gly He Gin Lys Gly 
20 25 30 



Asn Lys His Trp He He Lys Asn Ser Trp Gly Glu Asn Trp Gly Asn 
35 40 45 



Lys Gly Tyr He Leu Met Ala Arg Asn Lys Asn Asn Ala Cys Gly He 
50 55 60 



Ala Asn Leu Ala Ser Phe Pro Lys Met 
65 70 



<2}0> 169 

<211> 72 

<212> PRT 

<213> Homo sapien 

<400> 169 

Ser Ser Arg Tyr Pro Gin Gly Glu Tyr Glu Thr Cys Asn Ser Asp Asn 
15 10 15 



Leu Asn His Ala Val Leu Ala Val Gly Tyr Gly He Gin Lys Gly Asn 
20 25 30 



Lys His Trp He He Lys Asn Ser Trp Gly Glu Asn Trp Gly Asn Lys 
35 40 45 



Gly Tyr He Leu Met Ala Arg Asn Lys Asn Asn Ala Cys Gly He Ala 
50 55 60 



Asn Leu Ala Ser Phe Pro Lys Met 
65 70 



<210> 170 
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<211> 321 
<212> PRT 
<213> Homo sapien 

<400> 170 

Phe Val Ser Leu Cys Ser Gly Ser Ser Ser Cys Arg Ser Leu Leu Phe 
15 10 15 



Phe Phe Arg Phe Val Leu lie Arg Trp Ser Phe Pro Leu Leu Ser Ser 
20 25 30 



Ser Phe Ser Ser Ser Leu Phe Val Val Leu Phe Arg Arg Cys Gly Leu 
35 40 45 



Val Arg Phe Ser Arg Ser Val Leu Ala Ser Val Leu Leu Ala Leu Leu 
50 55 60 



Leu Leu Ser Ser Cys Val Arg Phe Pro Val Ala Cys Leu Ser Phe Ser 
65 70 75 80 



Leu Leu Leu Val lie Cys Phe Ser Leu Phe Leu Leu Phe Leu Ser Pro 
85 90 95 



Val Ser Pro Ser Phe Leu Val Ser Ser Ser Pro Phe Leu Leu Phe Ala 
100 105 110 



Cys Ala Cys Leu Ala Arg Ser Val Phe Phe Cys Leu Cys Phe Cys Arg 
115 120 125 



Val Arg Leu Ser Leu Val Phe Phe Gly Leu Leu Phe Leu Phe Ser Pro 
130 135 140 



Leu Arg Ser Leu Leu Phe Ser Val Leu Arg Ala Ser Val Pro Phe Val 
145 150 155 160 



Phe Phe Val Phe Phe Ala Ser Phe Arg Ser Leu Arg Ser Ser Ser Ser 
165 170 175 



Val Pro Leu Leu Ser Ser Phe Leu Pro Leu Ser Pro Phe Leu Leu Leu 
180 185 190 



Trp Leu Pro Ser Leu Ala Val Leu Pro Leu Arg Leu Pro Leu Leu Pro 
195 200 205 



Ser Val Val Ser Arg Cys Cys Ser Cys Val Leu Leu Cys Val Leu Val 
210 215 220 
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Leu Phe Trp Phe Leu Val Gly Gly Cys Val Val Cys Ala Leu Cys Val 
225 230 235 240 



Leu Phe Val Val Phe Val Arg Ser Trp Cys Thr Ala Glu Lys Ser His 
245 250 255 



His Gin Arg Thr Ser Phe Asn Arg Leu He Val Gly Ala Ser Pro Glu 
260 265 270 



Gly Leu Arg Ala Gly Arg Ser Gly Gly Cys Ser Arg Leu Leu Phe Phe 
275 280 285 



Ala Pro Trp Ala Leu Ser Lys Arg Ser Arg Tyr Leu Ala Leu Glu Gly 
290 295 300 



Thr Leu Ala Pro Pro Phe Phe Phe Cys Met Ser Thr Phe Ala Phe He 
305 310 315 320 



Glu 



<210> 171 

<211> 320 

<212> PRT 

<213> Homo sapien 

<400> 171 

Arg Leu Ala Leu Phe Trp Phe Phe Leu Val Ser Val Val He Val Leu 
15 10 15 



Leu Pro Leu Arg Ser His Ser Leu Val Leu Ser Leu Ala Leu Phe Leu 
20 25 30 



Leu Phe Leu Leu Pro Leu Arg Gly Ser Leu Pro Ala Leu Trp Ser Cys 
35 40 45 



Ser Phe Leu Ser Val Cys Pro Gly Phe Arg Leu Val Gly Ser Ser Pro 
50 55 60 



Ser Val Phe Leu Arg Pro Val Ser Cys Cys Leu Ser Val Phe Leu Ala 
65 70 75 80 



Ser Ala Arg His Leu Phe Leu Leu Val Leu Ala Phe Ser Leu Ser Gly 
85 90 95 



Leu Ala Phe Phe Pro Cys Leu Leu Leu Pro Phe Ser Ser Leu Cys Val 
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100 105 110 



Cys Leu Ser Arg Ser Leu Cys Val Leu Leu Ser Leu Leu Leu Ser Cys 
115 120 125 



Pro Leu Val Ser Cys Leu Leu Arg Leu Ala Phe Ser Leu Leu Ser Ser 
130 135 140 



Pro Phe Ala Ala Val Phe Cys Ser Pro Cys Leu Cys Pro Phe Cys Leu 
145 150 155 160 



Leu Arg Leu Leu Arg Phe Leu Pro Phe Pro Pro Leu Leu. Phe Leu Arg 
165 170 175 



Ser Ser Pro Leu Val Val Pro Pro Ala Leu Pro Phe Arg Cys Cys Ser 
180 185 190 



Gly Phe Pro Arg Trp Pro Ser Ser Leu Phe Ala Ser Leu Ser Phe Pro 
195 . 200 205 



Leu Ser Phe Leu Val Val Val Leu Val Cys Cys Cys Val Cys Ser Cys 
210 215 220 



Cys Ser Gly Ser Trp Leu Val Val Val Trp Cys Val Arg Cys Val Cys 
225 230 235 240 



Cys Leu Leu Cys Leu Cys Ala Arg Gly Val Leu Leu Arg Ser His Thr 
245 250 255 



Thr Ser Val Arg Arg Leu Thr Gly Tyr Arg Gly Arg He Pro Arg Gly 
260 265 270 



Ser Pro Ser Arg Ala Val Arg Arg Leu Phe Pro Pro Ser Phe Phe Cys 
275 280 285 



Ser Leu Gly Pro Val Gin Thr Val Pro Tyr Leu Ala Leu Glu Gly Thr 
290 295 300 



Leu Ala Pro Pro Phe Phe Phe Cys Met Ser Thr Phe Ala Phe He Glu 

310 315 320 



305 
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Arg Leu Ala Leu Phe Trp Phe Phe Leu Val Ser Val Val He Val Leu 
15 10 15 



Leu Pro Leu Arg Ser His Ser Leu Val Leu Ser Leu Ala Leu Phe Leu 
20 25 30 



Leu Phe Leu Leu Pro Leu Arg Gly Ser Leu Pro Ala Leu Trp Ser Cys 
35 40 45 



Ser Phe Leu Ser Val Cys Pro Gly Phe Arg Leu Val Gly Ser Ser Pro 
50 55 60 



Ser Val Phe Leu Arg Pro Val Ser Cys Cys Leu Ser Val Phe Leu Ala 
65 70 75 80 



Ser Ala Arg His Leu Phe Leu Leu Val Leu Ala Phe Ser Leu Ser Gly 
85 90 95 



Leu Ala Phe Phe Pro Cys Leu Leu Leu Pro Phe Ser Ser Leu Cys Val 
100 105 HO 



Cys Leu Ser Arg Ser Leu Cys Val Leu Leu Ser Leu Leu Leu Ser Cys 
115 120 125 



Pro Leu Val Ser Cys Leu Leu Arg Leu Ala Phe Ser Leu Leu Ser Ser 
130 135 140 



Pro Phe Ala Ala Val Phe Cys Ser Pro Cys Leu Cys Pro Phe Cys Leu 
145 150 155 160 



Leu Arg Leu Leu Arg Phe Leu Pro Phe Pro Pro Leu Leu Phe Leu Arg 
165 170 175 



Ser Ser Pro Leu Val Val Pro Pro Ala Leu Pro Phe Arg Cys Cys Ser 
180 185 190 



Gly Phe Pro Arg Trp Pro Ser Ser Leu Phe Ala Ser Leu Ser Phe Pro 
195 200 205 



Leu Ser Phe Leu Val Val Val Leu Val Cys Cys Cys Val Cys Ser Cys 
210 215 220 



Cys Ser Gly Ser Trp Leu Val Val Val Trp Cys Val Arg Cys Val Cys 
225 230 235 240 
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Cys Leu Leu Cys Leu Cys Ala Arg Gly Val Leu Leu Arg Ser His Thr 
245 250 255 



Thr Ser Val Arg Arg Leu Thr Gly Tyr Arg Gly Arg He Pro Arg Gly 
260 265 270 



Ser Pro Ser Arg Ala Val Arg Arg Leu Phe Pro Pro Ser Phe Phe Cys 
275 280 285 



Ser Leu Gly Pro Val Gin Thr Val Pro Tyr Leu Ala Leu Glu Gly Thr 
290 295 300 



Leu Ala Pro Pro Phe Phe Phe Cys Met Ser Thr Phe Ala Phe He Glu 

310 315 320 



305 
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Met Glu Glu Asn Leu He Ser Met Arg Glu Asp His Ser Phe His Val 
15 10 15 



Arg Tyr Arg Met Glu Ala Ser Cys Leu Glu Leu Ala Leu Glu Gly Glu 
20 25 30 



Arg Leu Cys Lys Ser Gly Asp Cys Arg Ala Gly Val Ser Phe Phe Glu 
35 40 45 



Ala Ala Val Gin Val Gly Thr Glu Asp Leu Lys Thr Leu Ser Ala He 
50 55 60 



Tyr Ser Gin Leu Gly Asn Ala Tyr Phe Tyr Leu His Asp Tyr Ala Lys 
65 70 75 80 



Ala Leu Glu Tyr His His His Asp Leu Thr Leu Ala Arg Thr He Gly 
85 90 95 



Asp Gin Leu Gly Glu Ala Lys Ala Ser Gly Asn Leu Gly Asn Thr Leu 
100 105 110 



Lys Val Leu Gly Asn Phe Asp Glu Ala He Val Cys Cys Gin Arg His 
115 120 125 



Leu Asp He Ser Arg Glu Leu Asn Asp Lys Val Gly Glu Ala Arg Ala 
130 135 140 
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Leu Tyr Asn Leu Gly Asn Val Tyr His Ala Lys Gly Lys Ser Phe Gly 
145 150 155 160 



Cys Pro Gly Pro Gin Asp Val Gly Glu Phe Pro Glu Glu Val Arg Asp 
165 170 175 



Ala Leu Gin Ala Ala Val Asp Phe Tyr Glu Glu Asn Leu Ser Leu Val 
180 185 190 



Thr Ala Leu Gly Asp Arg Ala Ala Gin Gly Arg Ala Phe Gly Asn Leu 
195 200 205 



Gly Asn Thr His Tyr Leu Leu Gly Asn Phe Arg Asp Ala Val lie Ala 
210 215 220 



His Glu Gin Arg Leu Leu He Ala Lys Glu Phe Gly Asp Lys Ala Ala 
225 230 235 240 



Glu Arg Arg Ala Tyr Ser Asn Leu Gly Asn Ala Tyr He Phe Leu Gly 
245 250 255 



Glu Phe Glu Thr Ala Ser Glu Tyr Tyr Lys Lys Thr Leu Leu Leu Ala 
260 265 270 



Arg Gin Leu Lys Asp Arg Ala Val Glu Ala Gin Ser Cys Tyr Ser Leu 
275 280 285 



Gly Asn Thr Tyr Thr Leu Leu Gin Asp Tyr Glu Lys Ala He Asp Tyr 
290 295 300 



His Leu Lys His Leu Ala He Ala Gin Glu Leu Asn Asp Arg lie Gly 
305 310 315 320 



Glu Gly Arg Ala Cys Trp Ser Leu Gly Asn Ala Tyr Thr Ala Leu Gly 
325 330 335 



Asn His Asp Gin Ala Met His Phe Ala Glu Lys His Leu Glu He Ser 
340 345 350 



Arg Glu Val Gly Asp Lys Ser Gly Glu Leu Thr Ala Arg Leu Asn Leu 
355 360 365 



Ser Asp Leu Gin Met Val Leu Gly Leu Ser Tyr Ser Thr Asn Asn Ser 
370 375 380 
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lie Met Ser Glu Asn Thr Glu lie Asp Ser Ser Leu Asn Gly Val Arg 
385 390 395 400 



Pro Lys Leu Gly Arg Arg His Ser Met Glu Asn Met Glu Leu Met Lys 
405 410 415 



Leu Thr Pro Glu Lys Val Gin Asn Trp Asn Ser Glu lie Leu Ala Lys 
420 425 430 



Gin Lys Pro Leu He Ala Lys Pro Ser Ala Lys Leu Leu Phe Val Asn 
435 440 445 



Arg Leu Lys Gly Lys Lys Tyr Lys Thr Asn Ser Ser Thr Lys Val Leu 
450 455 460 



Gin Asp Ala Ser Asn Ser He Asp His Arg He Pro Asn Ser Gin Arg 
465 470 475 480 



Lys He Ser Ala Asp Thr He Gly Asp Glu Gly Phe Phe Asp Leu Leu 
485 490 495 



Ser Arg Phe Gin Ser Asn Arg Met Asp Asp Gin Arg Cys Cys Leu Gin 
500 505 510 



Glu Lys Asn Cys His Thr Ala Ser Thr Thr Thr Ser Ser Thr Pro Pro 
515 520 525 



Lys Met Met Leu Lys Thr Ser Ser Val Pro Val Val Ser Pro Asn Thr 
530 535 540 



Asp Glu Phe Leu Asp Leu Leu Ala Ser Ser Gin Ser Arg Arg Leu Asp 
545 550 555 560 



Asp Gin Arg Ala Ser Phe Ser Asn Leu Pro Gly Leu Arg Leu Thr Gin 
565 570 575 



Asn Ser Gin Ser Val Leu Ser His Leu Met Thr Asn Asp Asn Lys Glu 
580 585 590 



Ala Asp Glu Asp Phe Phe Asp He Leu Val Lys Cys Gin Gly Ser Arg 
595 600 " 605 



Leu Asp Asp Gin Arg Cys Ala Pro Pro Pro Ala Thr Thr Lys Gly Pro 
610 615 620 
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Thr Val Pro Asp Glu Asp Phe Phe Ser Leu lie Leu Arg Ser Gin Gly 
625 630 635 640 



Lys Arg Met Asp Glu Gin Arg Val Leu Leu Gin Arg Asp Gin Asn Arg 
645 650 655 



Asp Thr Asp Phe Gly Leu Lys Asp Phe Leu Gin Asn Asn Ala Leu Leu 
660 665 670 



Glu Phe Lys Asn Ser Gly Lys Lys Ser Ala Asp His 

680 





675 
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Met Ser Gly His Ser Pro Thr Arg Gly Ala Met Gin Val Ala Met Asn 
15 10 15 



Gly Lys Ala Arg Lys Glu Ala Val Gin Thr Ala Ala Lys Glu Leu Leu 
20 25 30 



Lys Phe Val Asn Arg Ser Pro Ser Pro Phe His Ala Val Ala Glu Cys 
35 40 45 



Arg Asn Arg Leu Leu Gin Ala Gly Phe Ser Glu Leu Lys Glu Thr Glu 
50 55 60 



Lys Trp Asn lie Lys Pro Glu Ser Lys Tyr Phe Met Thr Arg Asn Ser 
65 70 75 80 



Ser Thr lie lie Ala Phe Ala Val Gly Gly Gin Tyr Val Pro Gly Asn 
85 90 95 



Gly Phe Ser Leu He Gly Ala His Thr Asp Ser Pro Cys Leu Arg Val 
100 105 HO 



Lys Arg Arg Ser Arg Arg Ser Gin Val Gly Phe Gin Gin Val Gly Val 
115 120 125 



Glu Thr Tyr Gly Gly Gly He Trp Ser Thr Trp Phe Asp Arg Asp Leu 
130 " ~ 135 14 0 



Thr Leu Ala Gly Arg Val He Val Lys Cys Pro Thr Ser Gly Arg Leu 
145 150 155 160 
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Glu Gin Gin Leu Val His Val Glu Arg Pro lie Leu Arg lie Pro His 
165 170 175 



Leu Ala lie His Leu Gin Arg Asn lie Asn Glu Asn Phe Gly Pro Asn 
180 185 190 



Thr Glu Met His Leu Val Pro He Leu Ala Thr Ala He Gin Glu Glu 
195 200 205 



Leu Glu Lys Gly Thr Pro Glu Pro Gly Pro Leu Asn Ala Val Asp Glu 
210 215 220 



Arg His His Ser Val Leu Met Ser Leu Leu Cys Ala His Leu Gly Leu 
225 230 235 240 



Ser Pro Lys Asp He Val Glu Met Glu Leu Cys Leu Ala Asp Thr Gin 
245 250 255 



Pro Ala Val Leu Gly Gly Ala Tyr Asp Glu Phe He Phe Ala Pro Arg 
260 265 270 



Leu Asp Asn Leu His Ser Cys Phe Cys Ala Leu Gin Ala Leu He Asp 
275 280 285 



Ser Cys Ala Gly Pro Gly Ser Leu Ala Thr Glu Pro His Val Arg Met 
290 295 300 



Val Thr Leu Tyr Asp Asn Glu Glu Val Gly Ser Glu Ser Ala Gin Gly 
305 310 315 320 



Ala Gin Ser Leu Leu Thr Glu Leu Val Leu Arg Arg He Ser Ala Ser 
325 330 335 



Cys Gin His Pro Thr Ala Phe Glu Glu Ala He Pro Lys Ser Phe Met 
340 345 350 



He Ser Ala Asp Met Ala His Ala Val His Pro Asn Tyr Leu Asp. Lys 
355 360 365 



His Glu Glu Asn His Arg Pro Leu Phe His Lys Gly Pro Val He Lys 
370 375 380 



Val Ala Lys Ala Pro Thr Cys Arg Pro Tyr Gly Leu Leu Gin Pro Phe 
385 390 395 400 
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Gly Asp Ala He Lys Thr Leu His Gin Arg Ala Leu Lys Pro Ala Asn 
405 410 415 



Leu Pro Ser Thr Phe Lys Ser Thr Ala Asp Leu Ala Phe Asn He Val 
420 425 430 



Leu Leu Leu Lys Pro Pro Ser He Ala Glu Pro Arg Val Asn Tyr Thr 
435 440 445 



Lys Val Tyr Leu Leu Arg Pro Leu Arg Ala Gly Thr His Pro Cys Gin 
450 455 460 



Gly Gly Thr Ala Ala Val Ala Gly Asp Ala Pro His Ser Ser Pro He 
465 470 475 480 



Leu Ser Ala Thr Arg Ala Ala Gin Lys His Lys Gin Gin Gly Ala Arg 
485 490 495 



Arg Arg Met 



<210> 175 

<211> 407 

<212> PRT 

<213> Homo sapien 

<400> 175 

Gly Ala Gly Gin Ala Ala Arg Trp Gly Arg Ala Glu Pro Gly Gly Gin 
1-5 10 15 



Met Ser Gly His Ser Pro Thr Arg Gly Ala Met Gin Val Ala Met Asn 
20 25 30 



Gly Lys Ala Arg Lys Glu Ala Val Gin Thr Ala Ala Lys Glu Leu Leu 
35 40 45 



Lys Phe Val Asn Arg Ser Pro Ser Pro Phe His Ala Val Ala Glu Cys 
50 55 60 



Arg Asn Arg Leu Leu Gin Ala Gly Phe Ser Glu Leu Lys Glu Thr Glu 
65 70 75 80 



Lys Trp Asn He Lys Pro Glu Ser Lys Tyr Phe Met Thr Arg Asn Ser 
85 90 95 



Ser Thr He He Ala Phe Ala Val Gly Gly Gin Tyr Val Pro Gly Asn 
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100 



105 



110 



Gly Phe Ser Leu He Gly Ala His Thr Asp Ser Pro Cys Leu Arg Val 
115 120 125 



Lys Arg Arg Ser Arg Arg Ser Gin Val Gly Phe Gin Gin Val Gly Val 
130 135 140 



Glu Thr Tyr Gly Gly Gly He Trp Ser Thr Trp Phe Asp Arg Asp Leu 
145 150 155 160 



Thr Leu Ala Gly Arg Val He Val Lys Cys Pro Thr Ser Gly Arg Leu 
165 170 175 



Glu Gin Gin Leu Val His Val Glu Arg Pro He Leu Arg He Pro His 
180 185 190 



Leu Ala He His Leu Gin Arg Asn He Asn Glu Asn Phe Gly Pro Asn 
195 200 205 



Thr Glu Met His Leu Val Pro He Leu Ala Thr Ala He Gin Glu Glu 
210 215 220 



Leu Glu Lys Gly Thr Pro Glu Pro Gly Pro Leu Asn Ala Val Asp Glu 
225 230 235 240 



Arg His His Ser Val Leu Met Ser Leu Leu Cys Ala His Leu Gly Leu 
245 250 255 



Ser Pro Lys Asp He Val Glu Met Glu Leu Cys Leu Ala Asp Thr Gin 
260 265 270 



Pro Ala Val Leu Gly Gly Ala Tyr Asp Glu Phe He Phe Ala Pro Arg 
275 280 285 



Leu Asp Asn Leu His Ser Cys Phe Cys Ala Leu Gin Ala Leu He Asp 
290 295 300 



Ser Cys Ala Gly Pro Gly Ser Leu Ala Thr Glu Pro His Val Arg Met 
305 310 315 320 



Val Thr Leu Tyr Asp Asn Glu Glu Val Gly Ser Glu Ser Ala Gin Gly 
325 330 335 



Ala Gin Ser Leu Leu Thr Glu Leu Val Leu Arg Arg He Ser Ala Ser 
340 345 350 
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Cys Gin His Pro Thr Ala Phe Glu Glu Ala lie Pro Lys Ser Phe Met 
355 360 365 



He Ser Ala Asp Met Ala His Ala Val His Pro Asn Tyr Leu Asp Lys 
370 375 380 



His Glu Glu Asn His Arg Pro Leu Phe His Lys Gly Pro Val He Lys 
385 390 395 400 



Val Ala Lys Ala Pro Thr Leu 
405 



<210> 176 

<211> 31 

<212> PRT 

<213> Homo sapien 

<400> 176 

Met His Leu He Thr Asp Asp Glu Ala Pro Tyr Arg Thr Pro Pro Pro 
1 5 10 15 



Ser Asn Ala His Val Gin Arg Ser He Asn Ala Leu He Asp Tyr 
20 25 30 
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<212> PRT 
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<400> 177 

Gly Thr Ala Asn Cys Asn Pro Val Pro Ala Ser Val Asn Glu Val He 
15 10 15 



Phe He Phe Pro Ser He His Met Phe He Tyr Tyr Leu Tyr Gly Cys 
20 25 30 



Phe Pro Thr Lys Trp Gin Ser Trp Ala Val Ala He Asp Thr He Trp 
35 40 45 



Ala Ala Lys Pro Lys He Phe Ser He Trp Pro Phe Thr Glu Lys Val 
50 55 60 



Cys Leu Pro Leu Ser 
65 



<210> 178 
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<211> 89 

<212> PRT 

<213> Homo sapien 

<400> 178 

Thr Ser Thr Ala Gin Cys Ala Gly lie Arg Leu Ser Ser Gly Ala Arg 
15 10 15 



Ala Gly Thr Val Asn Asn Asp Glu Gly Glu Trp Ser Gly Pro Pro Pro 
20 25 30 



Glu Cys Arg Gly Lys Ser Leu Thr Ser Lys Val Pro Pro Thr Val Gin 
35 40 45 



Lys Pro Thr Thr Val Asn Val Pro Thr Thr Glu Val Ser Pro Thr Ser 
50 55 60 



Gin Lys Thr Thr Thr Lys Thr Thr Thr Pro Asn Ala Gin Ala Thr Arg 
65 ~ 70 75 80 



Ser Thr Ser Ala Ala Thr Thr Leu Ser 
85 
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<211> 86 

<212> PRT 

<213> Homo sapien 

<400> 179 

Tyr Gly Ala Val Cys Gly lie Arg Phe Arg Ala Ala Pro Gly Ser Thr 
1 * 5 10 15 



Val Asn Asn Asp Glu Gly Glu Trp Ser Gly Pro Pro Pro Glu Cys Arg 
20 25 30 



Gly Lys Ser Leu Thr Ser Lys Val Pro Pro Thr Val Gin Lys Pro Thr 
35 40 45 



Thr Val Asn Val Pro Thr Thr Glu Val Ser Pro Thr Ser Gin Lys Thr 
50 55 60 



Thr Thr Lys Thr Thr Thr Pro Asn Ala Gin Ala Thr Arg Ser Thr Ser 
65 70 75 80 



Ala Ala Thr Thr Leu Ser 
85 
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Met Leu Lys Thr Ala Ser Ala Ala Leu Thr Trp Gly Leu Gly Pro Asp 
15 10 15 



Pro Ala Trp Ser His Pro Ala Gin Lys Thr Gly Pro Pro Val Pro Val 
20 25 30 



Thr His Cys Ser Tyr Gly Ala Arg Gly Phe Trp Cys Trp Gly Pro Pro 
35 40 45 



Cys Arg Trp Gly Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gin Lys 
50 55 60 



Pro Val Met Asp Asp Gin Arg Asp Leu lie Ser Asn Asn Glu Gin Leu 
65 70 75 80 



Pro Met Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg 
85 90 95 



Gly Ala Leu Tyr Thr Gly Phe Ser lie Leu Val Thr Leu Leu Leu Ala 
100 105 110 



Gly Gin Ala Thr Thr Ala Tyr Phe Leu Tyr Gin Gin Gin Gly Arg Leu 
115 120 125 



Asp Lys Leu Thr Val Thr Ser Gin Asn Leu Gin Leu Glu Asn Leu Arg 
130 135 140 



Met Lys Leu Pro Lys Pro Pro Lys Pro Val Ser Lys Met Arg Met Ala 
145 150 155 160 



Thr Pro Leu Leu Met Gin Ala Leu Pro Met Gly Ala Leu Pro Gin Gly 
165 170 175 



Pro Met Gin Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp His Val 
180 185 190 



Met His Leu Leu Gin Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu 
195 200 205 



Lys Gly Ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu 
210 215 220 



WO 2004/050858 



289/383 



PCT7US2003/038808 



Thr He Asp Trp Lys Val Phe Glu Ser Trp Met His His Trp Leu Leu 
225 230 235 " 240 



Phe Glu Met Ser Arg His Ser Leu Glu Gin Lys Pro Thr Asp Ala Pro 
245 250 255 



Pro Lys Val Leu Thr Lys Cys Gin Glu Glu Val Ser His He Pro Ala 
260 265 270 



Val His Pro Gly Ser Phe Arg Pro Lys Cys Asp Glu Asn Gly Asn Tyr 
275 280 285 



Leu Pro Leu Gin Cys Tyr Gly Ser He Gly Tyr Cys Trp Cys Val Phe 
290 295 300 



Pro Asn Gly Thr Glu Val Pro Asn Thr Arg Ser Arg Gly His His Asn 
305 310 315 320 



Cys Ser Gly Lys Gin Trp His Cys Ala Ser Val Arg Gly Pro Gly Arg 
325 330 335 



Thr Arg Lys Val Glu Gly Gin Glu Val Pro Ser Glu Ala His Gly Thr 
340 345 350 



Arg Thr Pro Arg Met Ala Ala Pro Gly Gly Ser Asp Val Val Met Arg 
355 360 365 



Pro Ala Leu Tyr Pro Ser Thr His Leu Cys He Pro He Cys Pro Phe 
370 375 380 



He He Ser Pro Leu Thr Cys Thr His He Cys Ser Leu He Arg Gin 
385 390 395 400 



Ser Phe His Leu 
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Leu Thr Ala Val Met Val Pro Ala Ala Ser Gly Val Gly Asp Leu Arg 
15 10 15 
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Ala Val Gly Gly Arg Arg Ser Arg Ser Cys Arg Glu Asp Gin Lys Pro 
20 25 30 



Val Met Asp Asp Gin Arg Asp Leu He Ser Asn Asn Glu Gin Leu Pro 
35 40 45 



Met Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg Gly 
50 55 60 



Ala Leu Tyr Thr Gly Phe Ser He Leu Val Thr Leu Leu Leu Ala Gly 
65 70 75 80- 



Gin Ala Thr Thr Ala Tyr Phe Leu Tyr Gin Gin Gin Gly Arg Leu Asp 
85 90 95 



Lys Leu Thr Val Thr Ser Gin Asn Leu Gin Leu Glu Asn Leu Arg Met 
100 105 HO 



Lys Leu Pro Lys Pro Pro Lys Pro Val Ser Lys Met Arg Met Ala Thr 
115 120 125 



Pro Leu Leu Met Gin Ala Leu Pro Met Gly Ala Leu Pro Gin Gly Pro 
130 135 140 



Met Gin Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp His Val Met 
145 150 155 160 



His Leu Leu Gin Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu Lys 
165 170 175 



Gly Ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu Thr 
180 185 190 



He Asp Trp Lys Val Phe Glu Ser Trp Met His His Trp Leu Leu Phe 
195 200 205 



Glu Met Ser Arg His Ser Leu Glu Gin Lys Pro Thr Asp Ala Pro Pro 
210 215 220 



Lys Val Leu Thr Lys Cys Gin Glu Glu Val Ser His He Pro Ala Val 
225 230 235 240 



His Pro Gly Ser Phe Arg Pro Lys Cys Asp Glu Asn Gly Asn Tyr Leu 
245 250 255 



Pro Leu Gin Cys Tyr Gly Ser He Gly Tyr Cys Trp Cys Val Phe Pro 
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260 265 270 



Asn Gly Thr Glu Val Pro Asn Thr Arg Ser Arg Gly His His Asn Cys 
275 280 285 



Ser Gly Lys Gin Trp His Cys Ala Ser Val Arg Gly Pro Gly Arg Thr 
290 295 300 



Arg Lys Val Glu Gly Gin Glu Val Pro Ser Glu Ala His Gly Thr Arg 
305 310 315 320 



Thr Pro Arg Met Ala Ala Pro Gly Gly Ser Asp Val Val Met Arg Pro 
325 330 335 



Ala Leu Tyr Pro Ser Thr His Leu Cys lie Pro lie Cys Pro Phe lie 
340 345 350 



He Ser Pro Leu Thr Cys Thr His He Cys Ser Leu He Arg Gin Ser 
355 360 365 



Phe His Leu 
370 
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<212> PRT 
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<400> 182 

He Leu His Gly Pro Leu Gly Gin Gly Ser His Gly Gin Arg Leu His 
15 10 15 



Gin Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu Lys Gly Ser Phe 
20 25 30 



Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu Thr He Asp Trp 
35 40 45 



Lys Val Phe Glu Ser Trp Met His His Trp Leu Leu Phe Glu Met Ser 
50 55 60 



Arg His Ser Leu Glu Gin Lys Pro Thr Asp Ala Pro Pro Lys Glu Ser 
65 70 75 80 



Leu Glu Leu Glu Asp Pro Ser Ser Gly Leu Gly Val Thr Lys Gin Asp 
85 90 95 
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Leu Gly Pro Val Pro Met 
100 
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Glu Gin Gin Arg Arg Ser Ser Thr Ser Cys Gin Pro His Thr Ala Thr 
15 10 15 



Ala Phe Leu Leu Pro Ser Ala Pro Ser Pro Ser Pro lie Ser His Pro 
20 25 30 

Val Pro His Pro Met Arg Pro Trp Cys Leu Ala Leu Ser Ser Pro Leu 
35 40 45 

Asp Lys Thr Asn Gin Val Gly Thr Ala Asp Asn Asn Ala Ala Arg Pro 
50 55 60 



Cys Cys Pro lie Ser He Cys Gin Gin Gly Arg Glu Val Pro Gly Ser 
65 70 75 80 



Gly Gin Lys Leu Asp Arg Ser Pro Phe Leu Thr Ser Gin Gin Pro Pro 
85 90 95 



Thr Gin Gly Ser Lys Thr 
100 



<210> 184 

<211> 273 

<212> PRT 

<213> Homo sapien 

<400> 184 

Met Glu Gly Val Leu Cys Pro Ala Gly Trp Trp Leu Arg Pro Gin Ala 
15 10 15 



Cys Ala Gin Gly Arg Trp Gin His Ser Glu Ala Thr Ser Thr Ala Glu 
20 25 30 



Leu Ala Val Glu He Val Tyr Val Val Pro Ser Val Ala Glu Ser Leu 
35 40 45 



Pro 



Ala Pro His Trp Leu Ser Thr Gin Gly Phe Gin Asn Leu Glu Lys 
50 55 60 
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Gly Ala Val Phe Trp Ser Trp Ser Cys Gly Val Pro Cys Ala Ser Met 
65 70 75 80 



Gly Trp Val Cys Ala Gin Ala Ala Leu Pro Trp Lys Cys Gly Cys Gly 
85 90 95 



Tyr Gly Leu Gly Val Trp Asp Pro Arg Gin Pro Lys Trp Glu Gin Gly 
100 105 110 



Arg Pro Val Gly Lys Gly Gly Ser Gly Leu Val Gly Ser Ala Ala Pro 
115 120 " 125 



Arg Cys Pro Phe Ser Val Gin Arg Gly Ser Asp Glu Leu Phe Ser Thr 
130 135 140 



Cys Val Thr Asn Gly Pro Phe lie Met Ser Ser Asn Ser Ala Ser Ala 
145 150 155 160 



Ala Asn Gly Asn Asp Ser Lys Lys Phe Lys Gly Asp Ser Arg Ser Ala 
165 170 175 



Gly Val Pro Ser Arg Val He His He Arg Lys Leu Pro He Asp Val 
180 185 190 



Thr Glu Gly Glu Val He Ser Leu Gly Leu Pro Phe Gly Lys Val Thr 
195 200 205 



Asn Leu Leu Met Leu Lys Gly Lys Asn Gin Ala Phe He Glu Met Asn 
210 215 220 



Thr Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr Ser Val Thr 
225 230 235 240 



Pro Val Leu Arg Gly Gin Pro He Tyr He Gin Phe Leu Cys Ala Leu 
245 250 255 



Ser Thr Ala Pro Ala Ser Cys Pro Gly Gly Ser Pro Arg He Pro Phe 
260 265 270 



Pro 



<210> 185 
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<213> Homo sapien 
<400> 185 

Glu Pro Pro Ser Pro Thr Leu Ala Glu Tyr Pro Gly Leu Pro Glu Phe 
15 10 15 



Gly Lys Arg Ser Ser Val Leu Val Val Glu Leu Trp Gly Ala Val Arg 
20 25 30 



Val His Gly Leu Gly Leu Arg Ser Gly Cys Pro Ala Val Glu Val Trp 
35 40 45 



Val Trp Val Trp Val Gly Gly Leu Gly Pro Gin Ala Ala Gin Val Gly 
50 55 60 



Ala Gly Pro Ala Gly Gly Glu Gly Arg Leu Trp Pro Gly Gly Lys Cys 
65 70 75 80 



Ser Ser Ala Leu Ser Leu Leu Leu Gin Arg Gly Ser Asp Glu Leu Phe 
85 90 95 



Ser Thr Cys Val Thr Asn Gly Pro Phe He Met Ser Ser Asn Ser Ala 
100 105 110 



Ser Ala Ala Asn Gly Asn Asp Ser Lys Lys Phe Lys Gly Asp Ser Arg 
115 120 125 



Ser Ala Gly Val Pro Ser Arg Val He His He Arg Lys Leu Pro He 
130 135 140 



Asp Val Thr Glu Gly Glu Val He Ser Leu Gly Leu Pro Phe Gly Lys 
145 150 155 160 



Val Thr Asn Leu Leu Met Leu Lys Gly Lys Asn Gin Ala Phe He Glu 
165 170 175 



Met Asn Thr Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr Ser 
180 185 190 



Val Thr Pro Val Leu Arg Gly Gin Pro He Tyr He Gin Phe Leu Cys 
195 200 205 



Ala Leu Ser Thr Ala Pro Ala Ser Cys Pro Gly Gly Ser Pro Arg He 
210 215 220 



Pro Phe Pro 
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Met Lys Ala Trp Phe Phe Pro Phe Ser lie Arg Arg Leu Val Thr Phe 
1 5 10 15 



Pro Lys Gly Ser Pro Arg Glu Met Thr. Ser Pro Ser Val Thr Ser Met 
20 25 30 



Gly Leu Phe Arg Arg Leu His Ser Val Pro Arg Gly Ser Ala Leu Cys 
35 40 45 



Ala Met Asp Gly lie Val Pro Asp lie Ala Val Gly Thr Lys Arg Gly 
50 55 60 



Ser Asp Glu Leu Phe Ser Thr Cys Val Thr Asn Gly Pro Phe lie Met 
65 70 75 80 



Ser Ser Asn Ser Ala Ser Ala Ala Asn Gly Asn Asp Ser Lys Lys Phe 
85 90 95 



Lys Gly Asp Ser Arg Ser Ala Gly Val Pro Ser Arg Val lie His lie 
100 105 110 



Arg Lys Leu Pro lie Asp Val Thr Glu Gly Glu Val lie Ser Leu Gly 
115 120 125 



Leu Pro Phe Gly Lys Val Thr Asn Leu Leu Met Leu Lys Gly Lys Asn 
130 135 140 



Gin Val Pro Glu Pro Arg Phe Ser Gly Val Leu He Thr Val Gin Ala 
145 150 155 160 



Gly Thr Arg Arg Ala Gin Arg Ser Leu Pro Pro Gin Gin Ala Phe He 
165 170 175 



Glu Met Asn Thr Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr 
180 185 190 



Ser Val Thr Pro Val Leu Arg Gly Gin Pro He Tyr He Gin Phe Ser 
195 200 205 
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Asn His Lys Glu Leu Lys Thr Asp Ser Ser Pro Asn Gin Ala Arg Ala 
210 215 220 



Gin Ala Ala Leu Gin Ala Val Asn Ser Val Gin Ser Gly Asn Leu Ala 
225 230 235 240 



Leu Ala Ala Ser Ala Ala Ala Val Asp Ala Gly Met Ala Met Ala Gly 
245 250 255 



Gin Ser Pro Val Leu Arg He He Val Glu Asn Leu Phe Tyr Pro Val 
260 265 270 



Thr Leu Asp Val Leu His Gin He Phe Ser Lys Phe Gly Thr Val Leu 
275 280 285 

Lys He He Thr Phe Thr Lys Asn Asn Gin Phe Gin Ala Leu Leu Gin 
290 295 300 



Tyr Ala Asp Pro Val Ser Ala Gin His Ala Lys Leu Ser Leu Asp Gly 
305 310 315 320 



Gin Asn He Tyr Asn Ala Cys Cys Thr Leu Arg He Asp Phe Ser Lys 
325 330 335 



Leu Thr Ser Leu Asn Val Lys Tyr Asn Asn Asp Lys Ser Arg Asp Tyr 
340 345 350 



Thr Arg Pro Asp Leu Pro Ser Gly Asp Ser Gin Pro Ser Leu Asp Gin 
355 360 365 



Thr Met Ala Ala Ala Phe Ala Ser Pro Tyr Ala Gly Ala Gly Phe Pro 
370 375 380 

Pro Thr Phe Ala He Pro Gin Ala Ala Gly Tyr Ser Asn Ala Trp Ser 
385 390 395 400 



Trp Phe Pro Ser Asp Cys Met Pro Thr Pro Pro Ser Gin Ala Ala Pro 
405 410 415 



His Pro Arg Arg Gin Pro Gly Arg Thr Gly His Ser Ser Ala Arg Ser 
420 425 430 



Gly Ala Leu Pro Gly Arg Gly Asp Ala Thr Ser Thr Glu Gin Ala Trp 
05 440 445 
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Pro Gly Gin Trp 
450 



<210> 187 

<211> 304 

<212> PRT 

<213> Homo sapien 

<400> 187 



Ala Ala Phe Leu Arg Gly Ala His Thr Val Gin Ala Gly Thr Arg Arg 
15 10 15 



Ala Gin Arg Ser Leu Pro Pro Gin Gin Ala Phe lie Glu Met Asn Thr 
20 25 30 



Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr Ser Val Thr Pro 
35 40 45 



Val Leu Arg Gly Gin Pro lie Tyr lie Gin Phe Ser Asn His Lys Glu 
50 55 60 



Leu Lys Thr Asp Ser Ser Pro Asn Gin Ala Arg Ala Gin Ala Ala Leu 
65 70 75 80 



Gin Ala Val Asn Ser Val Gin Ser Gly Asn Leu Ala Leu Ala Ala Ser 
85 90 95 



Ala Ala Ala Val Asp Ala Gly Met Ala Met Ala Gly Gin Ser Pro Val 
100 105 110 



Leu Arg lie lie Val Glu Asn Leu Phe Tyr Pro Val Thr Leu Asp Val 
115 120 125 



Leu His Gin lie Phe Ser Lys Phe Gly Thr Val Leu Lys He He Thr 
130 135 140 



Phe Thr Lys Asn Asn Gin Phe Gin Ala Leu Leu Gin Tyr Ala Asp Pro 
145 150 155 160 



Val Ser Ala Gin His Ala Lys Leu Ser Leu Asp Gly Gin Asn He Tyr 
165 170 175 



Asn Ala Cys Cys Thr Leu Arg He Asp Phe Ser Lys Leu Thr Ser Leu 
180 185 190 



Asn Val Lys Tyr Asn Asn Asp Lys Ser Arg Asp Tyr Thr Arg Pro Asp 
195 200 205 
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Leu Pro Ser Gly Asp Ser Gin Pro Ser Leu Asp Gin Thr Met Ala Ala 
210 215 220 



Ala Phe Ala Ser Pro Tyr Ala Gly Ala Gly Phe Pro Pro Thr Phe Ala 
225 230 235 240 



He Pro Gin Ala Ala Gly Tyr Ser Asn Ala Trp Ser Trp Phe Pro Ser 
245 250 255 



Asp Cys Met Pro Thr Pro Pro Ser Gin Ala Ala Pro His Pro Arg Arg 
260 265 270 



Gin Pro Gly Arg Thr Gly His Ser Ser Ala Arg Ser Gly Ala Leu Pro 
275 280 285 



Gly Arg Gly Asp Ala Thr Ser Thr Glu Gin Ala Trp Pro Gly Gin Trp 
290 295 300 



<210> 188 

<211> 606 

<212> PRT 

<213> Homo sapien 

<400> 188 

Met Lys Ala Trp Phe Phe Pro Phe Ser He Arg Arg Leu Val Thr Phe 
15 10 15 



Pro Lys Gly Ser Pro Arg Glu Met Thr Ser Pro Ser Val Thr Ser Met 
20 25 30 



Gly Leu Phe Arg Arg Leu His Ser Val Pro Arg Gly Ser Ala Leu Cys 
35 40 45 



Ala Met Asp Gly He Val Pro Asp He Ala Val Gly Thr Lys Arg Gly 
50 55 60 



Ser Asp Glu Leu Phe Ser Thr Cys Val Thr Asn Gly Pro Phe He Met 
65 70 75 80 



Ser Ser Asn Ser Ala Ser Ala Ala Asn Gly Asn Asp Ser Lys Lys Phe 
85 90 95 



Lys Gly Asp Ser Arg Ser Ala Gly Val Pro Ser Arg Val He His He 
100 105 110 
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Arg Lys Leu Pro He Asp Val Thr Glu Gly Glu Val He Ser Leu Gly 
115 120 125 



Leu Pro Phe Gly Lys Val Thr Asn Leu Leu Met Leu Lys Gly Lys Asn 
130 135 140 



Gin Val Pro Glu Pro Arg Phe Ser Gly Val Leu Thr Pro Cys Arg Arg 
145 150 155 160 



Gly Arg Gly Gly Pro Ser Ala His Cys Leu Pro Asn Glu Ala Phe He 
165 170 175 



Glu Met Asn Thr Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr 
180 185 190 



Ser Val Thr Pro Val Leu Arg Gly Gin Pro He Tyr He Gin Phe Ser 
195 200 205 



Asn His Lys Glu Leu Lys Thr Asp Ser Ser Pro Asn Gin Ala Arg Ala 
210 215 220 



Gin Ala Ala Leu Gin Ala Val Asn Ser Val Gin Ser Gly Asn Leu Ala 
225 230 235 240 



Leu Ala Ala Ser Ala Ala Ala Val Asp Ala Gly Met Ala Met Ala Gly 
245 250 255 



Gin Ser Pro Val Leu Arg He He Val Glu Asn Leu Phe Tyr Pro Val 
260 265 270 



Thr Leu Asp Val Leu His Gin He Phe Ser Lys Phe Gly Thr Val Leu 
275 280 285 



Lys He He Thr Phe Thr Lys Asn Asn Gin Phe Gin Ala Leu Leu Gin 
290 295 300 



Tyr Ala Asp Pro Val Ser Ala Gin His Ala Lys Leu Ser Leu Asp Gly 
305 310 315 320 



Gin Asn lie Tyr Asn Ala Cys Cys Thr Leu Arg He Asp Phe Ser Lys 
325 330 335 



Leu Thr Ser Leu Asn Val Lys Tyr Asn Asn Asp Lys Ser Arg Asp Tyr 
340 345 350 



Thr Arg Pro Asp Leu Pro Ser Gly Asp Ser Gin Pro Ser Leu Asp Gin 
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355 



360 



365 



Thr Met Ala Ala Ala Phe Gly Tyr Arg Gly Cys Pro Thr Arg Arg Gin 
370 375 380 



Cys Ala Glu Trp Ser He Arg Ala Ala Gin Ser Gly Ala Pro Ala Ala 
385 390 395 400 



Arg Ala Arg Pro Cys Thr Arg Val Met His Leu Leu Leu Ser Ala Arg 
405 410 415 



Pro Ala Gin His Gly Pro Val Ser Pro Thr Ser Gly Pro Arg Pro Pro 
420 425 430 



Leu Glu Gin Arg He Cys Pro Arg Arg Leu Cys Arg Gly Arg Pro Pro 
435 440 445 



Ala Gly Leu Gly Pro He Pro Gin His Ser Ala Arg Ser Leu Val Glu 
450 455 460 



Val Val Gly Ala Met He Ser Val Ser Phe He Ser Arg Cys Thr Trp 
465 470 475 480 



Tyr Asn Leu Ser Leu Ser Val Cys Arg Ser Trp Phe Pro Ser His Leu 
485 490 495 



Cys His Ser Ser Ser Cys Arg Val Phe Lys Arg Leu Val Leu Val Pro 
500 505 510 



Gin Arg Leu His Ala His Thr Thr Phe Pro Gly Ser Ser Ala Ser Thr 
515 520 525 



Ala Ala Ala Trp Ala Asp Trp Ala Leu Glu Cys Gin Val Arg Gly Pro 
530 535 540 



Ser Arg Glu Arg Gly Arg His Val His Arg Ala Gly Leu Ala Arg Ala 
545 550 555 560 



Val Val Gly Gin Gly Cys Gly Gly Pro Arg Val Asp Gly Ala Ser Gin 
565 570 575 



Lys Pro Leu He Ala Gin Ser Leu Trp Ser Gly Ser Leu Gin Ala Pro 
580 585 590 



Pro Gly Glu Arg Glu Asp Arg Gin Trp Leu Glu Gly Cys Leu 
595 600 605 
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<210> 189 

<211> 227 

<212> PRT 

<213> Homo sapien 

<400> 189 

Ala Ala Phe Leu Arg Gly Ala His Thr Val Gin Ala Gly Thr Arg Arg 
15 10 15 



Ala Gin Arg* Ser Leu Pro Pro Gin Gin Ala Phe lie Glu Met Asn Thr 
20 25 30 



Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr Ser Val Thr Pro 
35 40 45 



Val Leu Arg Gly Gin Pro lie Tyr He Gin Phe Ser Asn His Lys Glu 
50 55 60 



Leu Lys Thr Asp Ser Ser Pro Asn Gin Ala Arg Ala Gin Ala Ala Leu 
65 70 75 80 



Gin Ala Val Asn Ser Val Gin Ser Gly Asn Leu Ala Leu Ala Ala Ser 
85 90 95 



Ala Ala Ala Val Asp Ala Gly Met Ala Met Ala Gly Gin Ser Pro Val 
100 105 110 



Leu Arg He He Val Glu Asn Leu Phe Tyr Pro Val Thr Leu Asp Val 
115 120 125 



Leu His Gin He Phe Ser Lys Phe Gly Thr Val Leu Lys He He Thr 
130 135 140 



Phe Thr Lys Asn Asn Gin Phe Gin Ala Leu Leu Gin Tyr Ala Asp Pro 
145 150 155 160 



Val Ser Ala Gin His Ala Lys Leu Ser Leu Asp Gly Gin Asn He Tyr 
165 170 175 



Asn Ala Cys Cys Thr Leu Arg He Asp Phe Ser Lys Leu Thr Ser Leu 
180 185 190 



Asn Val Lys Tyr Asn Asn Asp Lys Ser Arg Asp Tyr Thr Arg Pro Asp 
195 200 205 
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Leu Pro Ser Gly Asp Ser Gin Pro Ser Leu Asp Gin Thr Met Ala Ala 
210 215 220 



Ala Phe Gly 



225 




<210> 


190 


<211> 


432 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


190 



Met Lys Ala Trp Phe Phe Pro Phe Ser lie Arg Arg Leu Val Thr Phe 
15 10 15 



Pro Lys Gly Ser Pro Arg Glu Met Thr Ser Pro Ser Val Thr Ser Met 
20 25 30 



Gly Leu Phe Arg Arg Leu His Ser Val Pro Arg Gly Ser Ala Leu Cys 
35 40 45 



Ala Met Asp Gly He Val Pro Asp He Ala Val Gly Thr Lys Arg Gly 
50 55 60 



Ser Asp Glu Leu Phe Ser Thr Cys Val Thr Asn Gly Pro Phe He Met 
65 70 75 80 



Ser Ser Asn Ser Ala Ser Ala Ala Asn Gly Asn Asp Ser Lys Lys Phe 
85 90 95 



Lys Gly Asp Ser Arg Ser Ala Gly Val Pro Ser Arg Val He His He 
100 105 110 



Arg Lys Leu Pro He Asp Val Thr Glu Gly Glu Val He Ser Leu Gly 
115 120 125 



Leu Pro Phe Gly Lys Val Thr Asn Leu Leu Met Leu Lys Gly Lys Asn 
130 135 140 



Gin Val Pro Glu Pro Arg Phe Ser Gly Val Leu He Thr Val Gin Ala 
145 150 155 160 



Gly Thr Arg Arg Ala Gin Arg Ser Leu Pro Pro Gin Gin Ala Phe He 
165 170 175 



Glu Met Asn Thr Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr 
180 185 190 
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Ser Val Thr Pro Val Leu Arg Gly Gin Pro He Tyr He Gin Phe Ser 
195 200 205 

Asn His Lys Glu Leu Lys Thr Asp Ser Ser Pro Asn Gin Ala Arg Ala 
210 215 220 

Gin Ala Ala Leu Gin Ala Val Asn Ser Val Gin Ser Gly Asn Leu Ala 
225 230 235 240 

Leu Ala Ala Ser Ala Ala Ala Val Asp Ala Gly Met Ala Met Ala Gly 
245 250 255 

Gin Ser Pro Val Leu Arg He He Val Glu Asn Leu Phe Tyr Pro Val 
260 265 270 

Thr Leu Asp Val Leu His Gin He Phe Ser Lys Phe Gly Thr Val Leu 
275 280 285 



Lys He He Thr Phe Thr Lys Asn Asn Gin Phe Gin Ala Leu Leu Gin 
290 295 300 



Tyr Ala Asp Pro Val Ser Ala Gin His Ala Lys Leu Ser Leu Asp Gly 
305 310 315 320 



Gin Asn He Tyr Asn Ala Cys Cys Thr Leu Arg He Asp Phe Ser Lys 
325 330 335 



Leu Thr Ser Leu Asn Val Lys Tyr Asn Asn Asp Lys Ser Arg Asp Tyr 
340 345 ~ 350 



Thr Arg Pro Asp Leu Pro Ser Gly Asp Ser Gin Pro Ser Leu Asp Gin 
355 360 365 



Thr Met Ala Ala Ala Phe Gly Leu Ser Val Pro Asn Val His Gly Ala 
370 375 380 



Leu Ala Pro Leu Ala He Pro Ser Ala Ala Ala Ala Ala Ala Ala Ala 
385 390 395 400 



Gly Arg He Ala He Pro Gly Leu Ala Gly Ala Gly Asn Ser Val Leu 
405 410 415 



Leu Val Ser Asn Leu Asn Pro Glu Ala Ser Thr Val Thr Cys Ser Ala 
420 425 430 
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<210> 191 

<211> 284 

<212> PRT 

<213> Homo sapien 

<400> 191 

Ala Ala Phe Leu Arg Gly Ala His Thr Val Gin Ala Gly Thr Arg Arg 
1 5 10 15 



Ala Gin Arg Ser Leu Pro Pro Gin Gin Ala Phe lie Glu Met Asn Thr 
20 25 30 



Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr Ser Val Thr Pro 
35 40 45 



Val Leu Arg Gly Gin Pro lie Tyr lie Gin Phe Ser Asn His Lys Glu 
50 55 60 



Leu Lys Thr Asp Ser Ser Pro Asn Gin Ala Arg Ala Gin Ala Ala Leu 
65 70 75 80 



Gin Ala Val Asn Ser Val Gin Ser Gly Asn Leu Ala Leu Ala Ala Ser 
85 90 95 



Ala Ala Ala Val Asp Ala Gly Met Ala Met Ala Gly Gin Ser Pro Val 
100 105 110 



Leu Arg He He Val Glu Asn Leu Phe Tyr Pro Val Thr Leu Asp Val 
115 120 125 



Leu His Gin He Phe Ser Lys Phe Gly Thr Val Leu Lys He He Thr 
130 135 1.40 



Phe Thr Lys Asn Asn Gin Phe Gin Ala Leu Leu Gin Tyr Ala Asp Pro 
145 150 155 160 



Val Ser Ala Gin His Ala Lys Leu Ser Leu Asp Gly Gin Asn He Tyr 
165 170 175 



Asn Ala Cys Cys Thr Leu Arg He Asp Phe Ser Lys Leu Thr Ser Leu 
180 185 190 



Asn Val Lys Tyr Asn Asn Asp Lys Ser Arg Asp Tyr Thr Arg Pro Asp 
195 200 205 



Leu Pro Ser Gly Asp Ser Gin Pro Ser Leu Asp Gin Thr Met Ala Ala 
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210 215 220 



Ala Phe Gly Leu Ser Val Pro Asn Val His Gly Ala Leu Ala Pro Leu 
225 230 235 240 



Ala lie Pro Ser Ala Ala Ala Ala Ala Ala Ala Ala Gly Arg He Ala 
245 250 " 255 

He Pro Gly Leu Ala Gly Ala Gly Asn Ser Val Leu Leu Val Ser Asn 

260 265 270 



Leu Asn Pro Glu Ala Ser Thr Val Thr Cys Ser Ala 

280 





275 


<210> 


192 


<211> 


304 


<212> 


PRT 


<213> 


Homo sap i en 


<400> 


192 



Ala Ala Phe Leu Arg Gly Ala His Thr Val Gin Ala Gly Thr Arg Arg 
1 5 10 15 



Ala Gin Arg Ser Leu Pro Pro Gin Gin Ala Phe He Glu Met Asn Thr 
20 25 30 



Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr Ser Val Thr Pro 
35 40 ' 45 



Val Leu Arg Gly Gin Pro He Tyr He Gin Phe Ser Asn His Lys Glu 
SO 55 60 



Leu Lys Thr Asp Ser Ser Pro Asn Gin Ala Arg Ala Gin Ala Ala Leu 
65 70 75 80 



Gin Ala Val Asn Ser Val Gin Ser Gly Asn Leu Ala Leu Ala Ala Ser 
85 90 95 



Ala Ala Ala Val Asp Ala Gly Met Ala Met Ala Gly Gin Ser Pro Val 
100 105 no 



Leu Arg He He Val Glu Asn Leu Phe Tyr Pro Val Thr Leu Asp Val 
115 120 125 



Leu His Gin He Phe Ser Lys Phe Gly Thr Val Leu Lys He He Thr 
130 135 140 
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Phe Thr Lys Asn Asn Gin Phe Gin Ala Leu Leu Gin Tyr Ala Asp Pro 
145 ' 150 155 160 



Val Ser Ala Gin His Ala Lys Leu Ser Leu Asp Gly Gin Asn lie Tyr 
165 170 175 



Asn Ala Cys Cys Thr Leu Arg lie Asp Phe Ser Lys Leu Thr Ser Leu 
180 185 190 



Asn Val Lys Tyr Asn Asn Asp Lys Ser Arg Asp Tyr Thr Arg Pro Asp 
195 200 205 



Leu Pro Ser Gly Asp Ser Gin Pro Ser Leu Asp Gin Thr Met Ala Ala 
210 215 220 



Ala Phe Ala Ser Pro Tyr Ala Gly Ala Gly Phe Pro Pro Thr Phe Ala 
225 230 235 240 



lie Pro Gin Ala Ala Gly Tyr Ser Asn Ala Trp Ser Trp Phe Pro Ser 
245 250 255 



Asp Cys Met Pro Thr Pro Pro Ser Gin Ala Ala Pro His Pro Arg Arg 
260 265 270 



Gin Pro Gly Arg Thr Gly His Ser Ser Ala Arg Ser Gly Ala Leu Pro 
275 280 285 



Gly Arg Gly Asp Ala Thr Ser Thr Glu Gin Ala Trp Pro Gly Gin Trp 
290 295 300 



<210> 193 

<211> 304 

<212> PRT 

<213> Homo sapien 

<400> 193 

Ala Ala Phe Leu Arg Gly Ala His Thr Val Gin Ala Gly Thr Arg Arg 
1 5 10 15 



Ala Gin Arg Ser Leu Pro Pro Gin Gin Ala Phe He Glu Met Asn Thr 
20 25 30 



Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr Ser Val Thr Pro 
35 40 45 



Val Leu Arg Gly Gin Pro He Tyr He Gin Phe Ser Asn His Lys Glu 
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50 



55 



60 



Leu Lys Thr Asp Ser Ser Pro Asn Gin Ala Arg Ala Gin Ala Ala Leu 
65 70 75 80 



Gin Ala Val Asn Ser Val Gin Ser Gly Asn Leu Ala Leu Ala Ala Ser 
85 90 95 



Ala Ala Ala Val Asp Ala Gly Met Ala Met Ala Gly Gin Ser Pro Val 
100 105 110 



Leu Arg lie lie Val Glu Asn Leu Phe Tyr Pro Val Thr Leu Asp Val 
115 120 125 



Leu His Gin lie Phe Ser Lys Phe Gly Thr Val Leu Lys lie lie Thr 
130 135 140 



Phe Thr Lys Asn Asn Gin Phe Gin Ala Leu Leu Gin Tyr Ala Asp Pro 
145 150 155 160 



Val Ser Ala Gin His Ala Lys Leu Ser Leu Asp Gly Gin Asn lie Tyr 
165 170 175 



Asn Ala Cys Cys Thr Leu Arg lie Asp Phe Ser Lys Leu Thr Ser Leu 
180 185 190 



Asn Val Lys Tyr Asn Asn Asp Lys Ser Arg Asp Tyr Thr Arg Pro Asp 
195 200 205 



Leu Pro Ser Gly Asp Ser Gin Pro Ser Leu Asp Gin Thr Met Ala Ala 
210 215 220 



Ala Phe Ala Ser Pro Tyr Ala Gly Ala Gly Phe Pro Pro Thr Phe Ala 
225 230 235 240 



lie Pro Gin Ala Ala Gly Tyr Ser Asn Ala Trp Ser Trp Phe Pro Ser 
245 250 255 



Asp Cys Met Pro Thr Pro Pro Ser Gin Ala Ala Pro His Pro Arg Arg 
260 265 270 



Gin Pro Gly Arg Thr Gly His Ser Ser Ala Arg Ser Gly Ala Leu Pro 
275 280 285 



Gly Arg Gly Asp Ala Thr Ser Thr Glu Gin Ala Trp Pro Gly Gin Trp 
290 295 300 
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<210> 194 

<211> 304 

<212> PRT 

<213> Homo sapien 

<400> 194 

Ala Ala Phe Leu Arg Gly Ala His Thr Val Gin Ala Gly Thr Arg Arg 
15 10 15 



Ala Gin Arg Ser Leu Pro Pro Gin Gin Ala Phe He Glu Met Asn Thr 
20 25 30 



Glu Glu Ala Ala Asn Thr Met Val Asn Tyr Tyr Thr Ser Val Thr Pro 
35 40 45 



Val Leu Arg Gly Gin Pro He Tyr He Gin Phe Ser Asn His Lys Glu 
50 55 60 



Leu Lys Thr Asp Ser Ser Pro Asn Gin Ala Arg Ala Gin Ala Ala Leu 
65 ~ 70 75 80 



Gin Ala Val Asn Ser Val Gin Ser Gly Asn Leu Ala Leu Ala Ala Ser 
85 90 95 



Ala Ala Ala Val Asp Ala Gly Met Ala Met Ala Gly Gin Ser Pro Val 
100 105 HO 



Leu Arg He He Val Glu Asn Leu Phe Tyr Pro Val Thr Leu Asp Val 
115 120 125 



Leu His Gin He Phe Ser Lys Phe Gly Thr Val Leu Lys He lie Thr 
130 135 140 



Phe Thr Lys Asn Asn Gin Phe Gin Ala Leu Leu Gin Tyr Ala Asp Pro 
145 * 150 155 160 



Val Ser Ala Gin His Ala Lys Leu Ser Leu Asp Gly Gin Asn He Tyr 
165 170 175 



Asn Ala Cys Cys Thr Leu Arg He Asp Phe Ser Lys Leu Thr Ser Leu 
180 185 190 



Asn Val Lys Tyr Asn Asn Asp Lys Ser Arg Asp Tyr Thr Arg Pro Asp 
195 200 205 
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Leu Pro Ser Gly Asp Ser Gin Pro Ser Leu Asp Gin Thr Met Ala Ala 
210 215 220 



Ala Phe Ala Ser Pro Tyr Ala Gly Ala Gly Phe Pro Pro Thr Phe Ala 
225 230 235 240 



He Pro Gin Ala Ala Gly Tyr Ser Asn Ala Trp Ser Trp Phe Pro Ser 
245 250 255 



Asp Cys Met Pro Thr Pro Pro Ser Gin Ala Ala Pro His Pro Arg Arg 
260 265 270 



Gin Pro Gly Arg Thr Gly His Ser Ser Ala Arg Ser Gly Ala Leu Pro 
275 • 280 285 



Gly Arg Gly Asp Ala Thr Ser Thr Glu Gin Ala Trp Pro Gly Gin Trp 
290 295 300 



<210> 195 

<211> 315 

<212> PRT 

<213> Homo sapien 

<400> 195 

Met Arg Ser Val Gin Ala Gly Leu Ser Ser Gin Glu Ser Leu Ser Pro 
15 10 15 



Val Leu Ser Leu Ser Pro Asp Ser Met Ser Phe Thr Thr Arg Ser Thr 
20 25 30 



Phe Ser Thr Asn Tyr Arg Ser Leu Gly Ser Val Gin Ala Pro Ser Tyr 
35 40 45 



Gly Ala Arg Pro Val Ser Ser Ala Ala Ser Val Tyr Ala Gly Ala Gly 
50 55 60 



Gly Ser Gly Ser Arg He Ser Val Ser Arg Ser Thr Ser Phe Arg Gly 
65 70 75 80 



Gly Met Gly Ser Gly Gly Leu Ala Thr Gly He Ala Gly Gly Leu Ala 
85 90 95 



Gly Met Gly Gly He Gin Asn Glu Lys Glu Thr Met Gin Ser Leu Asn 
100 105 110 



Asp Arg Leu Ala Ser Tyr Leu Asp Arg Val Arg Ser Leu Glu Thr Glu 
115 "* 120 125 
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Asn Arg Arg Leu Glu Ser Lys He Arg Glu His Leu Glu Lys Lys Gly 
130 135 140 



Pro Gin Val Arg Asp Trp Ser His Tyr Phe Lys He He Glu Asp Leu 
145 ^ 150 155 160 



Arg Ala Gin He Phe Ala Asn Thr Val Asp Asn Ala Arg He Val Leu 
165 170 175 



Gin He Asp Asn Ala Arg Leu Ala Ala Asp Asp Phe Arg Val Lys Tyr 
180 185 190 



Glu Thr Glu Leu Ala Met Arg Gin Ser Val Glu Asn Asp He His Gly 
195 * 200 205 



Leu Arg Lys Val He Asp Asp Thr Asn He Thr Arg Leu Gin Leu Glu 
210 215 220 



Thr Glu He Glu Ala Leu Lys Glu Glu Leu Leu Phe Met Lys Lys Asn 
225 230 * 235 240 



His Glu Glu Asp Arg Thr Ala Ala Thr Ser Pro Asp Ala Ser Ser Gly 
245 250 255 



Leu Pro Arg Arg Gin Asn Ala Pro Asn Thr Gly Thr Gly Arg Pro Gly 
260 265 270 



Asp Ser Ala Arg Asp Glu Trp Ser Arg Pro Arg Gly Glu Gly Arg Gin 
275 280 285 



Asn Gly Glu Pro Arg Pro Lys Arg Ser Val Asn Glu Pro Val Glu Arg 
290 295 300 



Pro Gly Ser Asp Asp Ser Gly His Gly Gly Thr 
305 310 315 



<210> 196 

<211> 296 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MIS COFEATURE 

<222> (245) . . (245) 

<223> X=any amino acid 
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<220> 

<221> MISC_FEATURE 

<222> (250) . . (251) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (253) . . (254) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (257) . . (258) 

<223> X=any amino acid 



<400> 196 



His Arg Ser Val Gin Ala Gly Leu Ser Ser Gin Glu Ser Leu Ser Pro 
15 10 15 



Val Leu Ser Leu Ser Pro Asp Ser Met Ser Phe Thr Thr Arg Ser Thr 
20 25 30 



Phe Ser Thr Asn Tyr Arg Ser Leu Gly Ser Val Gin Ala Pro Ser Tyr 
35 40 45 



Gly Ala Arg Pro Val Ser Ser Ala Ala Ser Val Tyr Ala Gly Ala Gly 
50 55 60 



Gly Ser Gly Ser Arg lie Ser Val Ser Arg Ser Thr Ser Phe Arg Gly 
65 70 75 80 



Gly Met Gly Ser Gly Gly Leu Ala Thr Gly lie Ala Gly Gly Leu Ala 
85 90 ' * 



Gly Met Gly Gly lie Gin Asn Glu Lys Glu Thr Met Gin Ser Leu Asn 
100 105 110 



Asp Arg Leu Ala Ser Tyr Leu Asp Arg Val Arg Ser Leu Glu Thr Glu 
115 120 ~ 125 



Asn Arg Arg Leu Glu Ser Lys lie Arg Glu His Leu Glu Lys Lys Gly 
130 135 140 



Pro Gin Val Arg Asp Trp Ser His Tyr Phe Lys He He Glu Asp Leu 
145 150 155 160 



Arg Ala Gin He Phe Ala Asn Thr Val Asp Asn Ala Arg He Val Leu 
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165 170 175 



Gin lie Asp Asn Ala Arg Leu Ala Ala Asp Asp Phe Arg Val Lys Tyr 
180 185 190 



Glu Thr Glu Leu Ala Met Arg Gin Ser Val Glu Asn Asp lie His Gly 
195 200 205 



Leu Arg Lys Val He Asp Asp Thr Asn He Thr Arg Leu Gin Leu Glu 
210 215 220 



Thr Glu He Glu Ala Leu Lys Glu Glu Leu Leu Phe Met Lys Lys Asn 
225 230 235 240 



His Glu Glu Asp Xaa Thr Gly Leu Gin Xaa Xaa Met Xaa Xaa Leu Gly 
245 250 255 



Xaa Xaa Glu Glu Met Pro Pro Thr Pro Gly Leu Ala Asp Gin Ala Thr 
260 265 270 



Arg Gin Glu Thr Ser Gly Arg Asp Arg Gly Glu Lys Val Ala Arg Met 
275 280 285 



Gly Ser His Gly Pro Ser Gly Val 
290 295 



<210> 197 

<211> 267 

<212> PRT 

<213> Homo sapien 

<400> 197 

Met Ser Phe Thr Thr Arg Ser Thr Phe Ser Thr Asn Tyr Arg Ser Leu 
1 5 10 15 



Gly Ser Val Gin Ala Pro Ser Tyr Gly Ala Arg Pro Val Ser Ser Ala 
20 25 30 



Ala Ser Val Tyr Ala Gly Ala Gly Gly Ser Gly Ser Arg He Ser Val 
35 40 45 



Ser Arg Ser Thr Ser Phe Arg Gly Gly Met Gly Ser Gly Gly Leu Ala 
50 55 * 60 



Thr Gly He Ala Gly Gly Leu Ala Gly Met Gly Gly He Gin Asn Glu 
65 70 75 80 
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Lys Glu Thr Met Gin Ser Leu Asn Asp Arg Leu Ala Ser Tyr Leu Asp 
85 90 95 



Arg Val Arg Ser Leu Glu Thr Glu Asn Arg Arg Leu Glu Ser Lys He 
100 105 no 



Arg Glu His Leu Glu Lys Lys Gly Pro Gin Val Arg Asp Trp Ser His 
H5 120 125 



Tyr Phe Lys He He Glu Asp Leu Arg Ala Gin He Phe Ala Asn Thr 
130 135 140 



Val Asp Asn Ala Arg He Val Leu Gin He Asp Asn Ala Arg Leu Ala 
145 150 155 ~ 160 



Ala Asp Asp Phe Arg. Val Lys Tyr Glu Thr Glu Leu Ala Met Arg Gin 
165 170 175 



Ser Val Glu Asn Asp He His Gly Leu Arg Lys Val He Asp Asp Thr 
180 185 190 



Asn He Thr Arg Leu Gin Leu Glu Thr Glu He Glu Ala Leu Lys Glu 
195 200 205 



Glu Leu Leu Phe Met Lys Lys Asn His Glu Glu Glu Val Lys Gly Leu 
210 215 220 



Gin Ala Gin He Ala Ser Ser Gly Val Asp Arg Gly Gly Arg Cys Pro 
225 230 235 ~ 240 



Gin He Ser Gly Pro Arg Gin Asp His Gly Arg His Ser Gly Pro Asn 
245 250 255 



Met Thr Ser Trp Leu Gly Arg Thr Glu Arg Ser 
260 265 



<210> 198 

<211> 454 

<212> PRT 

<213> Homo sapien 

<400> 198 

Lys Ala Ala Ser Arg Ala Asn Asn Thr Cys Cys Pro Cys Pro Cys Pro 
15 10 ' 15 



Val Gly His Pro Val Ser Gly Gly Met Ser Phe Thr Thr Arg Ser Thr 
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20 



25 



30 



Phe Ser Thr Asn Tyr Arg Ser Leu Gly Ser Val Gin Ala Pro Ser Tyr 
35 40 45 



Gly Ala Arg Pro Val Ser Ser Ala Ala Ser Val Tyr Ala Gly Ala Gly 
50 55 60 



Gly Ser Gly Ser Arg lie Ser Val Ser Arg Ser Thr Ser Phe Arg Gly 
65 70 75 80 



Gly Met Gly Ser Gly Gly Leu Ala Thr Gly He Ala Gly Gly Leu Ala 
85 90 95 



Gly Met Gly Gly He Gin Asn Glu Lys Glu Thr Met Gin Ser Leu Asn 
100 105 110 



Asp Arg Leu Ala Ser Tyr Leu Asp Arg Val Arg Ser Leu Glu Thr Glu 
115 120 125 



Asn Arg Arg Leu Glu Ser Lys He Arg Glu His Leu Glu Lys Lys Gly 
130 135 140 



Pro Gin Val Arg Asp Trp Ser His Tyr Phe Lys He He Glu Asp Leu 
145 150 155 160 



Arg Ala Gin He Phe Ala Asn Thr Val Asp Asn Ala Arg He Val Leu 
165 170 175 



Gin He Asp Asn Ala Arg Leu Ala Ala Asp Asp Phe Arg Val Lys Tyr 
180 185 190 



Glu Thr Glu Leu Ala Met Arg Gin Ser Val Glu Asn Asp He His Gly 
195 200 205 



Leu Arg Lys Val He Asp Asp Thr Asn He Thr Arg Leu Gin Leu Glu 
210 215 220 



Thr Glu He Glu Ala Leu Lys Glu Glu Leu Leu Phe Met Lys Lys Asn 
225 230 235 240 



His Glu Glu Glu Val Lys Gly Leu Gin Ala Gin He Ala Ser Ser Gly 
245 250 255 



Leu Thr Val Glu Val Asp Ala Pro Lys Ser Gin Asp Leu Ala Lys lie 
260 265 270 
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Met Ala Asp He Arg Ala Gin Tyr Asp Glu Leu Ala Arg Lys Asn Arg 
275 280 285 



Glu Glu Leu Asp Lys Tyr Trp Ser Gin Gin He Glu Glu Ser Thr Thr 
290 295 300 



Val Val Thr Thr Gin Ser Ala Glu Val Gly Ala Ala Glu Thr Thr Leu 
305 310 315 320 



Thr Glu Leu Arg Arg Thr Val Gin Ser Leu Glu He Asp Leu Asp Ser 
325 330 " 335 



Met Arg Asn Leu Lys Ala Ser Leu Glu Asn Ser Leu Arg Glu Val Glu 
340 345 350 



Ala Arg Tyr Ala Leu Gin Met Glu Gin Leu Asn Gly He Leu Leu His 
355 360 365 



Leu Glu Ser Glu Leu Ala Gin Thr Arg Ala Glu Gly Gin Arg Gin Ala 
370 375 380 



Gin Glu Tyr Glu Ala Leu Leu Asn He Lys Val Lys Leu Glu Ala Glu 
385 390 395 400 



He Ala Thr Tyr Arg Arg Leu Leu Glu Asp Gly Glu Asp Phe Asn Leu 
405 410 415 



Gly Asp Ala Leu Asp Ser Ser Asn Ser Met Gin Thr He Gin Lys Thr 
420 425 430 



Thr Thr Arg Arg He Val Asp Gly Lys Val Val Ser Glu Thr Asn Asp 
435 440 445 



Thr Lys Val Leu Arg His 
450 



<210> 199 

<211> 104 

<212> PRT 

<213> Homo sapien 

<400> 199 

Met Leu Glu Thr Lys Trp Ser Leu Leu Gin Gin Gin Lys Thr Ala Arg 
1 5 io 15 
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Ser Asn Met Asp Asn Met Phe Glu Ser Tyr He Asn Asn Leu Arq Ara 
20 25 30 

Gin Leu Glu Thr Leu Gly Gin Glu Lys Leu Lys Leu Glu Ala Glu Leu 
35 40 45 



Gly Asn Met Gin Gly Leu Val Glu Asp Phe Lys Asn Lys Ser Glu Gin 
50 55 60 



Leu Pro Pro Ser Thr Gin Leu Lys Ser Pro Cys Ser Pro Pro Pro Leu 
65 ™ 75 80 

Gly Thr Leu Gly Leu Ser Pro His Gly Pro Leu Leu Trp Ala Val Gin 
85 90 95 

Ser Tyr Leu He Arg Val Thr Thr 
100 



<210> 200 

<211> 768 

<212> PRT 

<213> Homo sapien 

<400> 200 

Met Gly Gly Ala Glu Arg Gly Arg Ala Pro Ala Phe Leu Leu Arg Ser 
1 5 10 is 

Ala Pro Val Ser Ala Gly Gly Gly Gly Ala Tyr He Thr Cys Ala Ala 
20 25 30 

Pro Leu Leu Arg Glu Asp Val Ala Cys Ser Leu Ala Pro Gly Glu Ser 
35 40 45 



Pro Arg Leu Pro Arg Glu Leu Gly Glu Arg Asn Trp Arg Arg Ser Thr 
50 55 60 

Pro Gly Gly Ser Ala Leu Gin His Glu Pro He Ser Ser Arg Ser Arq 
65 70 75 80 

Ser Arg Arg Pro Gly Arg Gly Ala Ala Ser Glu Asp Glu Asn Gly Asp 
*5 90 95 

Asp Gin Gly Phe Gin Glu Gly Glu Asp Glu Leu Gly Asp Glu Glu Glu 
100 105 110 

Gly Ala Gly Asp Glu Asn Gly His Gly Glu Gin Gin Pro Gin Pro Pro 
115 120 125 
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Ala Thr Gin Gin Gin Gin Pro Gin Gin Gin Arg Gly Ala Ala Lys Glu 
130 135 140 



Ala Ala Gly Lys Ser Ser Gly Pro Thr Ser Leu Phe Ala Val Thr Val 
145 150 155 160 



Ala Pro Pro Gly Ala Arg Gin Gly Gin Gin Gin Ala Gly Gly Asp Gly 
165 170 175 



Lys Thr Glu Gin Lys Gly Gly Asp Lys Lys Arg Gly Val Lys Arg Pro 
180 185 190 



Arg Glu Asp His Gly Arg Gly Tyr Phe Glu Tyr He Glu Glu Asn Lys 
195 200 205 



Tyr Ser Arg Ala Lys Ser Pro Gin Pro Pro Val Glu Glu Glu Asp Glu 
210 215 220 



His Phe Asp Asp Thr Val Val Cys Leu Asp Thr Tyr Asn Cys Asp Leu 
225 230 235 240 



His Phe Lys He Ser Arg Asp Arg Leu Ser Ala Ser Ser Leu Thr Met 
245 250 255 



Glu Ser Phe Ala Phe Leu Trp Ala Gly Gly Arg Ala Ser Tyr Gly Val 
260 265 270 



Ser Lys Gly Lys Val Cys Phe Glu Met Lys Val Thr Glu Lys He Pro 
275 280 285 



Val Arg His Leu Tyr Thr Lys Asp He Asp He His Glu Val Arg lie 
290 295 300 



Gly Trp Ser Leu Thr Thr Ser Gly Met Leu Leu Gly Glu Glu Glu Phe 
305 310 315 320 



Ser Tyr Gly Tyr Ser Leu Lys Gly He Lys Thr Cys Asn Cys Glu Thr 
325 330 335 



Glu Asp Tyr Gly Glu Lys Phe Asp Glu Asn Asp Val He Thr Cys Phe 
340 345 350 



Ala Asn Phe Glu Ser Asp Glu Val Glu Leu Ser Tyr Ala Lys Asn Gly 
355 360 365 
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Gin Asp Leu Gly Val Ala Phe Lys He Ser Lys Glu Val Leu Ala Gly 
370 375 380 



Arg Pro Leu Phe Pro His Val Leu Cys His Asn Cys Ala Val Glu Phe 
385 390 395 400 



Asn Phe Gly Gin Lys Glu Lys Pro Tyr Phe Pro He Pro Glu Glu Tyr 
405 410 415 



Thr Phe He Gin Asn Val Pro Leu Glu Asp Arg Val Arg Gly Pro Lys 
420 425 430 



Gly Pro Glu Glu Lys Lys Asp Cys Glu Val Val Met Met He Gly Leu 
435 440 445 



Pro Gly Ala Gly Lys Thr Thr Trp Val Thr Lys His Ala Ala Glu Asn 
450 455 460 



Pro Gly Lys Tyr Asn He Leu Gly Thr Asn Thr He Met Asp Lys Met 
465 470 475 480 



Met Val Ala Gly Phe Lys Lys Gin Met Ala Asp Thr Gly Lys Leu Asn 
485 490 495 



Thr Leu Leu Gin Arg Ala Pro Gin Cys Leu Gly Lys Phe He Glu He 
500 505 • 510 



Ala Ala Arg Lys Lys Arg Asn Phe He Leu Asp Gin Thr Asn Val Ser 
515 520 525 



Ala Ala Ala Gin Arg Arg Lys Met Cys Leu Phe Ala Gly Phe Gin Arg 
530 535 540 



Lys Ala Val Val Val Cys Pro Lys Asp Glu Asp Tyr Lys Gin Arg Thr 
545 550 555 560 



Gin Lys Lys Ala Glu Val Glu Gly Lys Asp Leu Pro Glu His Ala Val 
565 570 575 



Leu Lys Met Lys Gly Asn Phe Thr Leu Pro Glu Val Ala Glu Cys Phe 
580 585 590 



Asp Glu He Thr Tyr Val Glu Leu Gin Lys Glu Glu Ala Gin Lys Leu 
595 600 605 
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Leu Glu Gin Tyr Lys Glu Glu Ser Lys Lys Ala Leu Pro Pro Glu Lys 
610 615 620 



Lys Gin Asn Thr Gly Ser Lys Lys Ser Asn Lys Asn Lys Ser Gly Lys 
625 630 635 640 



Asn Gin Phe Asn Arg Gly Gly Gly His Arg Gly Arg Gly Gly Phe Asn 
645 650 655 



Met Arg Gly Gly Asn Phe Arg Gly Gly Ala Pro Gly Asn Arg Gly Gly 
660 665 670 



Tyr Asn Arg Arg Gly Asn Met Pro Gin Arg Gly Gly Gly Gly Gly Gly 
675 680 685 



Ser Gly Gly lie Gly Tyr Pro Tyr Pro Arg Ala Pro Val Phe Pro Gly 
690 695 700 



Arg Gly Ser Tyr Ser Asn Arg Gly Asn Tyr Asn Arg Gly Gly Met Pro 
705 710 715 ' * 720 



Asn Arg Gly Asn Tyr Asn Gin Asn Phe Arg Gly Arg Gly Asn Asn Arg 
725 730 * 735 



Gly Tyr Lys Asn Gin Ser Gin Gly Tyr Asn Gin Trp Gin Gin Gly Gin 
740 745 750 



Phe Trp Gly Gin Lys Pro Trp Ser Gin His Tyr His Gin Gly Tyr Tyr 
755 760 765 



<210> 201 

<211> 793 

<212> PRT 

<213> Homo sapien 

<400> 201 

He Gly Ser Ala Leu Arg Gly Leu His Arg Ala Leu Gly Pro Gly Ser 
1 5 10 15 



Ala Trp Gly Arg Gly Gly Lys Tyr Gly Asn Gly Arg Gly Arg Ala Arg 
20 25 30 



Ala Arg Ser Gly Phe Pro Pro Ala Gin Cys Ser Arg Gin Arg Arg Gly 
35 40 45 



Arg Gly Ala Tyr He Thr Cys Ala Ala Pro Leu Leu Arg Glu Asp Val 
50 55 60 
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Ala Cys Ser Leu Ala Pro Gly Glu Ser Pro Arg Leu Pro Arg Glu Leu 
65 70 75 80 



Gly Glu Arg Asn Trp Arg Arg Ser Thr Pro Gly Gly Ser Ala Leu Gin 
85 90 95 



His Glu Pro He Ser Ser Arg Ser Arg Ser Arg Arg Pro Gly Arg Gly 
100 105 HO 



Ala Ala Ser Glu Asp Glu Asn Gly Asp Asp Gin Gly Phe Gin Glu Gly 
115 120 125 



Glu Asp Glu Leu Gly Asp Glu Glu Glu Gly Ala Gly Asp Glu Asn Gly 
130 135 140 



His Gly Glu Gin Gin Pro Gin Pro Pro Ala Thr Gin Gin Gin Gin Pro 
145 * 150 155 160 



Gin Gin Gin Arg Gly Ala Ala Lys Glu Ala Ala Gly Lys Ser Ser Gly 
165 170 175 



Pro Thr Ser Leu Phe Ala Val Thr Val Ala Pro Pro Gly Ala Arg Gin 
180 185 190 



Gly Gin Gin Gin Ala Gly Gly Asp Gly Lys Thr Glu Gin Lys Gly Gly 
195 200 2 05 



Asp Lys Lys Arg Gly Val Lys Arg Pro Arg Glu Asp His Gly Arg Gly 
210 215 220 



Tyr Phe Glu Tyr He Glu Glu Asn Lys Tyr Ser Arg Ala Lys Ser Pro 
225 230 235 240 



Gin Pro Pro Val Glu Glu Glu Asp Glu His Phe Asp Asp Thr Val Val 
245 250 255 



Cys Leu Asp Thr Tyr Asn Cys Asp Leu His Phe Lys He Ser Arg Asp 
260 " 265 270 



Arg Leu Ser Ala Ser Ser Leu Thr Met Glu Ser Phe Ala Phe Leu Trp 
275 280 285 



Ala Gly Gly Arg Ala Ser Tyr Gly Val Ser Lys Gly Lys Val Cys Phe 
290 295 300 
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Glu Met Lys Val Thr Glu Lys lie Pro Val Arg His Leu Tyr Thr Lys 
305 310 315 320 



Asp lie Asp lie His Glu Val Arg lie Gly Trp Ser Leu Thr Thr Ser 
325 330 335 



Gly Met Leu Leu Gly Glu Glu Glu Phe Ser Tyr Gly Tyr Ser Leu Lys 
340 345 350 



Gly lie Lys Thr Cys Asn Cys Glu Thr Glu Asp Tyr Gly Glu Lys Phe 
355 360 365 



Asp Glu Asn Asp Val He Thr Cys Phe Ala Asn Phe Glu Ser Asp Glu 
370 375 380 



Val Glu Leu Ser Tyr Ala Lys Asn Gly Gin Asp Leu Gly Val Ala Phe 
385 390 395 400 



Lys He Ser Lys Glu Val Leu Ala Gly Arg Pro Leu Phe Pro His Val 
405 410 415 



Leu Cys His Asn Cys Ala Val Glu Phe Asn Phe Gly Gin Lys Glu Lys 
420 425 ~ 430 



Pro Tyr Phe Pro He Pro Glu Glu Tyr Thr Phe He Gin Asn Val Pro 
435 440 445 



Leu Glu Asp Arg Val Arg Gly Pro Lys Gly Pro Glu Glu Lys Lys Asp 
450 455 460 



Cys Glu Val Val Met Met He Gly Leu Pro Gly Ala Gly Lys Thr Thr 
465 470 475 ~ 480 



Trp Val Thr Lys His Ala Ala Glu Asn Pro Gly Lys Tyr Asn He Leu 
485 490 * 495 



Gly Thr Asn Thr He Met Asp Lys Met Met Val Ala Gly Phe Lys Lys 
500 505 510 



Gin Met Ala Asp Thr Gly Lys Leu Asn Thr Leu Leu Gin Arg Ala Pro 
515 520 525 



Gin Cys Leu Gly Lys Phe He Glu He Ala Ala Arg Lys Lys Arg Asn 
530 535 540 
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Phe lie Leu Asp Gin Thr Asn Val Ser Ala Ala Ala Gin Arg Arg Lys 
545 550 555 560 



Met Cys Leu Phe Ala Gly Phe Gin Arg Lys Ala Val Val Val Cys Pro 
565 570 575 



Lys Asp Glu Asp Tyr Lys Gin Arg Thr Gin Lys Lys Ala Glu Val Glu 
580 585 590 



Gly Lys Asp Leu Pro Glu His Ala Val Leu Lys Met Lys Gly Asn Phe 
595 600 605 



Thr Leu Pro Glu Val Ala Glu Cys Phe Asp Glu lie Thr Tyr Val Glu 
610 615 620 



Leu Gin Lys Glu Glu Ala Gin Lys Leu Leu Glu Gin Tyr Lys Glu Glu 
625 630 635 640 



Ser Lys Lys Ala Leu Pro Pro Glu Lys Lys Gin Asn Thr Gly Ser Lys 
645 650 655 



Lys Ser Asn Lys Asn Lys Ser Gly Lys Asn Gin Phe Asn Arg Gly Gly 
660 665 670 



Gly His Arg Gly Arg Gly Gly Phe Asn Met Arg Gly Gly Asn Phe Arg 
675 680 685 



Gly Gly Ala Pro Gly Asn Arg Gly Gly Tyr Asn Arg Arg Gly Asn Met 
690 695 700 



Pro Gin Arg Gly Gly Gly Gly Gly Gly Ser Gly Gly lie Gly Tyr Pro 
705 710 715 720 



Tyr Pro Arg Ala Pro Val Phe Pro Gly Arg Gly Ser Tyr Ser Asn Arg 
725 730 735 



Gly Asn Tyr Asn Arg Gly Gly Met Pro Asn Arg Gly Asn Tyr Asn Gin 
740 745 750 



Asn Phe Arg Gly Arg Gly Asn Asn Arg Gly Tyr Lys Asn Gin Ser Gin 
755 760 765 



Gly Tyr Asn Gin Trp Gin Gin Gly Gin Phe Trp Gly Gin Lys Pro Trp 
770 775 780 



Ser Gin His Tyr His Gin Gly Tyr Tyr 
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785 790 



<210> 202 

<211> 325 

<212> PRT 

<213> Homo sapien 

<400> 202 



Met He Phe Arg Asp Phe He Phe Ala Gin Val He Ser Lys Tyr Glu 
1 5 10 15 



Asp He Phe Phe Val Phe Phe Gin Val Gly Cys Leu Lys Phe Phe Pro 
20 25 30 



Phe Phe He Pro Leu Pro Lys Ala Asn Glu Lys Lys Val Asp Gin Pro 
35 40 45 



Pro Glu Ala Lys Lys Pro Lys He Lys Val Val Asn Val Glu Leu Pro 
50 55 60 



He Glu Ala Asn Leu Val Trp Gin Leu Gly Lys Asp Leu Leu Asn Met 
65 70 75 80 



Tyr lie Glu Thr Glu Gly Lys Met He Met Gin Asp Lys Leu Glu Lys 
85 90 95 



Glu Arg Asn Asp Ala Lys Asn Ala Val Glu Glu Tyr Val Tyr Glu Phe 
100 105 110 



Arg Asp Lys Leu Cys Gly Pro Tyr Glu Lys Phe He Cys Glu Gin Asp 
115 120 125 



His Gin Asn Phe Leu Arg Leu Leu Thr Glu Thr Glu Asp Trp Leu Tyr 
130 135 140 



Glu Glu Gly Glu Asp Gin Ala Lys Gin Ala Tyr Val Asp Lys Leu Glu 
145 150 155 160 



Glu Leu Met Lys He Gly Thr Pro Val Lys Val Arg Phe Gin Glu Ala 
165 170 175 



Glu Glu Arg Pro Lys Met Phe Glu Glu Leu Gly Gin Arg Leu Gin His 
180 185 190 



Tyr Ala Lys He Ala Ala Asp Phe Arg Asn Lys Asp Glu Lys Tyr Asn 
195 200 205 
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His He Asp Glu Ser Glu Met Lys Lys Val Glu Lys Ser Val Asn Glu 
210 215 220 



Val Met Glu Trp Met Asn Asn Val Met Asn Ala Gin Ala Lys Lys Ser 
225 230 235 240 



Leu Asp Gin Asp Pro Val Val Arg Ala Gin Glu He Lys Thr Lys He 
245 250 255 



Lys Glu Leu Asn Asn Thr Cys Glu Pro Val Val Thr Gin Pro Lys Pro 
260 265 270 



Lys He Glu Ser Pro Lys Leu Glu Arg Thr Pro Asn Gly Pro Asn He 
275 280 285 



Asp Lys Lys Glu Glu Asp Leu Glu Asp Lys Asn Asn Phe Gly Ala Glu 
290 295 300 



Pro Pro His Gin Asn Gly Glu Cys Tyr Pro Asn Glu Lys Asn Ser Val 
305 310 315 320 



Asn Met Asp Leu Asp 
325 



<210> 203 

<211> 131 

<212> PRT 

<213> Homo sapien 

<400> 203 

Met Lys Val Gin Asn Arg Arg Asn Ala His Leu Ser Ser Phe Asp Arg 
1 5 10 15 



Phe Ala Phe Leu Trp Glu Met Gin Phe Arg Val Thr Cys Cys Arg Arg 
20 25 30 



Tyr Val Val Asp His Leu Asp He He Ala Glu Asp Glu Leu Leu Asp 
35 40 45 



Gin Leu Leu Ser His Thr Ala Ser Leu Ser Asp Leu Leu Gin Asn Ser 
50 55 60 



Val Val Ala Thr Ala Gin Thr Arg Leu Arg Ser Tyr Ser Cys Ala Ser 
65 70 75 80 



Leu Arg Phe Ser Ser Ala Thr Met Ser Asp Lys Pro Asp Met Ala Glu 
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85 90 95 



He Glu Lys Phe Asp Lys Ser Lys Leu Lys Lys Thr Glu Thr Gin Glu 
100 105 110 



Lys Asn Pro Leu Pro Ser Lys Glu Thr He Glu Gin Glu Lys Gin Ala 
115 120 125 



Gly Glu Ser 
130 



<210> 204 

<211> 116 

<212> PRT 

<213> Homo sapien 

<400> 204 

Ser Leu Arg Leu Pro Trp Glu Met Gin Phe Arg Val Thr Cys Cys Arg 
1 5 10 15 



Arg Tyr Val Val Asp His Leu Asp He He Ala Glu Asp Glu Leu Leu 
20 25 30 



Asp Gin Leu Leu Ser His Thr Ala Ser Leu Ser Asp Leu Leu Gin Asn 
35 40 45 



Ser Val Val Ala Thr Ala Gin Thr Arg Leu Arg Ser Tyr Ser Cys Ala 
50 55 " 60 



Ser Leu Arg Phe Ser Ser Ala Thr Met Ser Asp Lys Pro Asp Met Ala 
65 70 75 80 



Glu lie Glu Lys Phe Asp Lys Ser Lys Leu Lys Lys Thr Glu Thr Gin 
85 90 95 



Glu Lys Asn Pro Leu Pro Ser Lys Glu Thr He Glu Gin Glu Lys Gin 
100 105 110 



Ala Gly Glu Ser 





115 


<210> 


205 


<211> 


63 


<212> 


PRT 


<213> 


Homo 


<400> 


205 
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Met Cys Leu Gly Val Cys Ser Phe Arg Cys Ser Asp Met Ser Arg Val 
1 5 10 15 



Ser Ser Phe Gin Trp Val Arg Gly Leu Thr Asp Phe Arg Asn Glu Ala 
20 25 30 



Ala Tyr Ala His Ala Pro Gin Lys Ser Ser Pro Ser Pro His Ser Thr 
35 40 45 



Gin Glu Val Leu Leu Ser Ser Pro Val Thr Ser Tyr Trp Gly Gly 
50 55 60 



<210> 206 

<211> 97 

<212> PRT 

<213> Homo sapien 

<400> 206 

Cys Val Ser Glu Phe Val Pro Ser Asp Val Gin lie Cys Pro Glu Phe 
15 10 15 



Leu Pro Ser Ser Gly Phe Val Val Ser Leu Thr Ser Gly Met Lys Leu 
20 25 30 



His Thr Leu Thr Leu His Arg Lys Val Leu Gin Val Pro Thr Arg Pro 
35 40 45 



Arg Lys Ser Cys Cys Leu His Leu Ser Pro Ala Thr Gly Glu Ala Glu 
50 55 60 



Ala Gly Glu Ser Leu Glu Pro Arg Arg Trp Arg Leu Leu Gly Ala Lys 
65 70 75 80 



Pro Arg Ser Arg His Tyr Thr Pro Ala Trp Gin Gin Ser Glu Thr Pro 
85 " 90 95 



Ser 



<210> 207 

<211> 163 

<212> PRT 

<213> Homo sapien 

<400> 207 



Met Asn Gly Glu Tyr Tyr Gin Glu Ser Asn Gly Pro Thr Asp Ser Tyr 
15 10 15 
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Ala Ala lie Ser Gin Val Asp Arg Leu Gin Ser Glu Pro Glu Ser lie 
20 25 30 



Arg Lys Trp Arg Glu Glu Gin Met Glu Arg Leu Glu Ala Leu Asp Ala 
35 40 45 



Asn Ser Arg Lys Gin Glu Ala Glu Trp Lys Glu Lys Ala lie Lys Glu 
50 55 60 



Leu Glu Glu Trp Tyr Ala Arg Gin Asp Glu Gin Leu Gin Lys Thr Lys 
65 70 75 80 



Ala Asn Asn Arg Ala Ala Glu Glu Ala Phe Val Asn Asp He Asp Glu 
85 90 95 



Ser Ser Pro Gly Thr Glu Trp Glu Arg Val Ala Arg Leu Cys Asp Phe 
100 105 110 



Asn Pro Lys Ser Ser Lys Gin Ala Lys Asp Val Phe Pro Pro Cys Ala 
115 120 125 



Gin Ser Ser Ser Pro Ser Ser Arg Pro Leu Val Leu Glu Ala Pro Val 
130 135 140 



Asp Leu Leu Val Leu Pro Arg Gly Leu Arg Trp Val Leu Gly Trp Pro 
145 150 155 160 



Leu Leu Ala 



<210> 208 

<211> 283 

<212> PRT 

<213> Homo sapien 

<400> 208 

Arg Leu Gly Thr Arg Leu Gly Val Ala Pro Leu Pro Val Gly Thr Thr 
15 10 15 



Ala Val Ala Ala Gly Arg Gly Val Gly Gly Ser Val Gly Phe Cys Leu 
20 25 30 



Thr Val Gly Val Arg Ala Val Gin Leu Pro Ala Met Ala Glu Leu Asp 
35 40 • 45 



Pro Phe Gly Ala Pro Ala Gly Ala Pro Gly Gly Pro Ala Leu Gly Asn 
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50 



55 



60 



Gly Val Ala Gly Ala Gly Glu Glu Asp Pro Ala Ala Ala Phe Leu Ala 
65 70 75 80 

Gin Gin Glu Ser Glu He Ala Gly He Glu Asn Asp Glu Ala Phe Ala 

85 90 95 



He Leu Asp Gly Gly Ala Pro Gly Pro Gin Pro His Gly Glu Pro Pro 
100 105 HO 



Gly Gly Pro Asp Ala Val Asp Gly Val Met Asn Gly Glu Tyr Tyr Gin 
115 120 125 



Glu Ser Asn Gly Pro Thr Asp Ser Tyr Ala Ala He Ser Gin Val Asp 
130 135 140 



Arg Leu Gin Ser Glu Pro Glu Ser He Arg Lys Trp Arg Glu Glu Gin 
145 150 155 160 



Met Glu Arg Leu Glu Ala Leu Asp Ala Asn Ser Arg Lys Gin Glu Ala 
165 170 175 



Glu Trp Lys Glu Lys Ala He Lys Glu Leu Glu Glu Trp Tyr Ala Arg 
180 185 190 



Gin Asp Glu Gin Leu Gin Lys Thr Lys Ala Asn Asn Arg Ala Ala Glu 
195 200 205 



Glu Ala Phe Val Asn Asp He Asp Glu Ser Ser Pro Gly Thr Glu Trp 
210 215 220 



Glu Arg Val Ala Arg Leu Cys Asp Phe Asn Pro Lys Ser Ser Lys Gin 
225 230 " 235 240 



Ala Lys Asp Val Ser Arg Met Arg Ser Val Leu He Ser Leu Lys Gin 
245 250 255 



Ala Ala Gly Ala Gly Gly Pro Val Asp Leu Leu Val Leu Pro Arg Gly 
260 265 270 



Leu Arg Trp Val Leu Gly Trp Pro Leu Leu Ala 
275 280 



<210> 209 
<211> 222 
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<212> PRT 

<213> Homo sapien 

<400> 209 

Met Ala Glu Leu Asp Pro Phe Gly Ala Pro Ala Gly Ala Pro Gly Gly 
15 10 15 



Pro Ala Leu Gly Asn Gly Val Ala Gly Ala Gly Glu Glu Asp Pro Ala 
20 25 30 



Ala Ala Phe Leu Ala Gin Gin Glu Ser Glu lie Ala Gly lie Glu Asn 
35 40 45 



Asp Glu Ala Phe Ala lie Leu Asp Gly Ala Ala Pro Pro Gly Pro Ser 
50 55 60 



Arg Thr Ala Ser Arg Arg Gly Val Arg Met Gin Glu Ser Asn Gly Pro 
65 70 75 80 



Thr Asp Ser Tyr Ala Ala lie Ser Gin Val Asp Arg Leu Gin Ser Glu 
85 90 95 



Pro Glu Ser lie Arg Lys Trp Arg Glu Glu Gin Met Glu Arg Leu Glu 
100 105 110 



Ala Leu Asp Ala Asn Ser Arg Lys Gin Glu Ala Glu Trp Lys Glu Lys 
115 120 125 



Ala lie Lys Glu Leu Glu Glu Trp Tyr Ala Arg Gin Asp Glu Gin Leu 
130 135 140 



Gin Lys Thr Lys Ala Asn Asn Arg Ala Ala Glu Glu Ala Phe Val Asn 
145 150 155 160 



Asp lie Asp Glu Ser Ser Pro Gly Thr Glu Trp Glu Arg Val Ala Arg 
165 170 175 



Leu Cys Asp Phe Asn Pro Lys Ser Ser Lys Gin Ala Lys Asp Val Phe 
180 185 190 



Pro Pro Cys Ala Gin Ser Ser Ser Pro Ser Ser Arg Pro Arg Trp Cys 
195 200 205 



Thr Glu Glu Pro Pro Cys Gly Asn Thr Thr Ser Ala lie Ser 
210 215 220 
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<210> 210 

<211> 250 

<212> PRT 

<213> Homo sapien 

<400> 210 

Asn Arg Thr Arg Gly Ala Leu Val Leu Leu Ser Gin Ser Ala Pro Gin 
1 5 io 15 



Arg Trp Leu Pro Gly Val Val Ser Val Gly Arg Leu Val Phe Val Ser 
20 25 30 



Pro Leu Val Ser Val Pro Phe Ser Cys Pro Pro Trp Leu Ser Trp He 
35 40 ^ 45 



Arg Ser Ala Pro Leu Pro Ala Pro Leu Ala Val Pro Arg Trp Gly Thr 
50 55 60 



Glu Trp Pro Ala Pro Ala Lys Lys Thr Arg Leu Arg Pro Ser Trp Arg 
65 70 75 80 



Ser Lys Arg Ala Arg Leu Arg Ala Ser Arg Thr Thr Arg Pro Ser Pro 
85 90 95 



Ser Trp Thr Ala Ala Pro Pro Gly Pro Ser Arg Thr Ala Ser Arg Arg 
100 105 no 



Gly Val Arg Met Gin Glu Ser Asn Gly Pro Thr Asp Ser Tyr Ala Ala 
H5 120 125 



He Ser Gin Val Asp Arg Leu Gin Ser Glu Pro Glu Ser He Arg Lys 
130 135 140 



Trp Arg Glu Glu Gin Met Glu Arg Leu Glu Ala Leu Asp Ala Asn Ser 
145 150 155 160 



Arg Lys Gin Glu Ala Glu Trp Lys Glu Lys Ala He Lys Glu Leu Glu 
165 170 175 



Glu Trp Tyr Ala Arg Gin Asp Glu Gin Leu Gin Lys Thr Lys Ala Asn 
180 185 190 



Asn Arg Ala Ala Glu Glu Ala Phe Val Asn Asp He Asp Glu Ser Ser 
195 200 205 



Pro Gly Thr Glu Trp Glu Arg - ; al Ala Arg Leu Cys Asp Phe Asn Pro 
210 215 220 
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Lys Ser Ser Lys Gin Ala Lys Asp Val Ser Arg Met Arg Ser Val Leu 
225 230 235 240 



lie Ser Leu Lys Gin Ala Pro Leu Val His 
245 250 



<210> 211 

<211> 142 

<212> PRT 

<213> Homo sapien 

<400> 211 

Ser Ser Arg Ser Arg Ala Ala Ala Arg Asp Leu Glu Pro Asp Ala Trp 
15 10 15 



Gly Ala Cys Pro Pro Leu Pro Val Gly Thr Thr Ala Val Ala Ala Gly 
20 25 30 



Arg Gly Val Gly Gly Ser Val Gly Phe Cys Leu Thr Val Gly Val Arg 
35 40 45 



Ala Val Gin Leu Pro Ala Met Ala Glu Leu Asp Pro Phe Gly Ala Pro 
50 55 60 



Ala Gly Ala Pro Gly Gly Pro Ala Leu Gly Asn Gly Val Ala Gly Ala 
65 70 75 80 



Gly Glu Glu Asp Pro Ala Ala Ala Phe Leu Ala Gin Gin Glu Ser Glu 
85 90 95 



lie Ala Gly lie Glu Asn Asp Glu Ala Phe Ala lie Leu Asp Gly Ala 
100 105 110 



Ala Pro Pro Gly Pro Ser Arg Thr Ala Ser Arg Arg Gly Val Arg Met 
115 120 125 



Pro lie Leu Gly Ser Lys Lys Gin Ser Gly Lys Lys Arg Gin 
130 135 140 



<210> 212 

<211> 220 

<212> PRT 

<213> Homo sapien 

<400> 212 



Ser Ser Arg Ser Arg Ala Ala Ala Arg Asp Leu Glu Pro Asp Ala Trp 
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10 15 



Gly Ala Cys Pro Pro Leu Pro Val Gly Thr Thr Ala Val Ala Ala Gly 
20 25 30 



Arg Gly Val Gly Gly Ser Val Gly Phe Cys Leu Thr Val Gly Val Arg 
35 40 45 



Ala Val Gin Leu Pro Ala Met Ala Glu Leu Asp Pro Phe Gly Ala Pro 
50 55 60 



Ala Gly Ala Pro Gly Gly Pro Ala Leu Gly Asn Gly Val Ala Gly Ala 
65 70 75 80 



Gly Glu Glu Asp Pro Ala Ala Ala Phe Leu Ala Gin Gin Glu Ser Glu 
85 90 95 



He Ala Gly He Glu Asn Asp Glu Ala Phe Ala He Leu Asp Gly Gly 
100 105 HO 



Ala Pro Gly Pro Gin Pro His Gly Glu Pro Pro Gly Gly Pro Asp Ala 
115 120 125 



Asn Ser Arg Lys Gin Glu Ala Glu Trp Lys Glu Lys Ala He Lys Glu 
130 135 140 



Leu Glu Glu Trp Tyr Ala Arg Gin Asp Glu Gin Leu Gin Lys Thr Lys 
145 150 155 160 



Ala Asn Asn Arg Ala Ala Glu Glu Ala Phe Val Asn Asp He Asp Glu 
165 170 175 



Ser Ser Pro Gly Thr Glu Trp Glu Arg Val Ala Arg Leu Cys Asp Phe 
180 185 190 



Asn Pro Lys Ser Ser Lys Gin Ala Lys Asp Val Ser Arg Met Arg Ser 
195 200 205 



Val Leu He Ser Leu Lys Gin Ala Pro Leu Val His 
210 215 220 



<210> 213 

<211> 111 

<212> PRT 

<213> Homo sapien 



<400> 213 
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Met Asn Gly Glu Tyr Tyr Gin Glu Ser Asn Gly Pro Thr Asp Ser Tyr 
1 5 10 15 



Ala Ala lie Ser Gin Val Asp Arg Leu Gin Ser Glu Pro Glu Ser lie 
20 25 30 



Arg Lys Trp Arg Glu Glu Gin Met Glu Arg Leu Glu Ala Leu Asp Ala 
35 40 45 



Asn Ser Arg Lys Gin Glu Ala Glu Trp Lys Glu Lys Ala lie Lys Glu 
50 55 60 



Leu Glu Glu Trp Tyr Ala Arg Gin Asp Glu Gin Leu Gin Lys Thr Lys 
65 70 75 80 



Ala Asn Asn Arg Val Ala Asp Glu Ala Phe Tyr Lys Gin Pro Phe Ala 
85 90 95 



Asp Val lie Gly Tyr Val Gin Gly Ser Arg Arg Ser Leu Cys Lys 
100 105 110 



<210> 214 

<211> 236 

<212> PRT 

<213> Homo sapien 

<400> 214 

Gly Phe Arg Phe Thr Arg Leu Pro Pro Gly Ala Cys Pro Pro Leu Pro 
1 5 10 15 



Val Gly Thr Thr Ala Val Ala Ala Gly Arg Gly Val Gly Gly Ser Val 
20 25 * 30 



Gly Phe Cys Leu Thr Val Gly Val Arg Ala Val Gin Leu Pro Ala Met 
35 40 45 



Ala Glu Leu Asp Pro Phe Gly Ala Pro Ala Gly Ala Pro Gly Gly Pro 
50 55 60 



Ala Leu Gly Asn Gly Val Ala Gly Ala Gly Glu Glu Asp Pro Ala Ala 
65 70 75 80 



Ala Phe Leu Ala Gin Gin Glu Ser Glu He Ala Gly He Glu Asn Asp 
85 90 95 



Glu Ala Phe Ala He Leu Asp Gly Gly Ala Pro Gly Pro Gin Pro His 
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100 105 110 



Gly Glu Pro Pro Gly Gly Pro Asp Ala Val Asp Gly Val Met Asn Gly 
115 120 125 



Glu Tyr Tyr Gin Glu Ser Asn Gly Pro Thr Asp Ser Tyr Ala Ala lie 
130 135 140 



Ser Gin Val Asp Arg Leu Gin Ser Glu Pro Glu Ser lie Arg Lys Trp 
145 150 155 160 



Arg Glu Glu Gin Met Glu Arg Leu Glu Ala Leu Asp Ala Asn Ser Arg 
165 170 175 



Lys Gin Glu Ala Glu Trp Lys Glu Lys Ala lie Lys Glu Leu Glu Glu 
180 185 190 



Trp Tyr Ala Arg Gin Asp Glu Gin Leu Gin Lys Thr Lys Ala Asn Asn 
195 200 205 



Arg Val Ala Asp Glu Ala Phe Tyr Lys Gin Pro Phe Ala Asp Val lie 
210 215 220 



Gly Tyr Val Gin Gly Ser Arg Arg Ser Leu Cys Lys 

230 235 



225 




<210> 


215 


<211> 


211 


<212> 


PRT 


<213> 


Homo sap i en 


<400> 


215 



Met Asn Gly Glu Tyr Tyr Gin Glu Ser Asn Gly Pro Thr Asp Ser Tyr 
15 10 15 



Ala Ala lie Ser Gin Val Asp Arg Leu Gin Ser Glu Pro Glu Ser lie 
20 25 30 



Arg Lys Trp Arg Glu Glu Gin Met Glu Arg Leu Glu Ala Leu Asp Ala 
35 40 45 



Asn Ser Arg Lys Gin Glu Ala Glu Trp Lys Glu Lys Ala He Lys Glu 
50 55 60 



Leu Glu Glu Trp Tyr Ala Arg Gin Asp Glu Gin Leu Gin Lys Thr Lys 
65 70 75 80 
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Ala Asn Asn Arg Ala Ala Glu Glu Ala Phe Val Asn Asp lie Asp Glu 
85 90 95 



Ser Ser Pro Gly Thr Glu Trp Glu Arg Val Ala Arg Leu Cys Asp Phe 
100 105 110 



Asn Pro Lys Ser Ser Lys Gin Ala Lys Asp Val Phe Pro Pro Cys Ala 
115 120 125 



Gin Ser Ser Ser Pro Ser Ser Arg Pro Val Gly Asp Glu Glu His Leu 
130 135 140 



Trp Lys Thr Asn Leu lie Phe lie Tyr Gin Gly Ser Leu Ser Met lie 
145 150 155 160 



Tyr Val Glu Val Leu Glu Asn Leu Cys Ser Ser Val Leu Leu Val Gly 
165 170 175 



Phe Ser Lys Gly Ser Gly Lys Arg Gly Arg Lys Gly Ala Lys Arg Trp 
180 185 190 



Phe Lys Phe Gly Arg Lys Lys Lys Gly Leu Gly Cys Phe Val Gly Leu 
195 200 205 



Lys Phe lie 
210 



<210> 


216 


<211> 


263 


<212> 


PRT 


<213> 


Homo sapien 


<220> 




<221> 


MI SC_FEATURE 


<222> 


(263) . . (263) 


<223> 


X=any amino 


<400> 


216 



Gly Phe Arg Phe Thr Arg Leu Pro Pro Gly Ala Cys Pro Pro Leu Pro 
15 10 15 



Val Gly Thr Thr Ala Val Ala Ala Gly Arg Gly Val Gly Gly Ser Val 
20 25 30 



Gly Phe Cys Leu Thr Val Gly Val Arg Ala Val Gin Leu Pro Ala Met 
35 40 45 
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Ala Glu Leu Asp Pro Phe Gly Ala Pro Ala Gly Ala Pro Gly Gly Pro 
50 55 60 



Ala Leu Gly Asn Gly Val Ala Gly Ala Gly Glu Glu Asp Pro Ala Ala 
65 " 70 75 80 



Ala Phe Leu Ala Gin Gin Glu Ser Glu lie Ala Gly lie Glu Asn Asp 
85 90 95 



Glu Ala Phe Ala lie Leu Asp Gly Gly Ala Pro Gly Pro Gin Pro His 
100 105 110 



Gly Glu Pro Pro Gly Gly Pro Asp Ala Val Asp Gly Val Met Asn Gly 
115 120 125 



Glu Tyr Tyr Gin Glu Ser Asn Gly Pro Thr Asp Ser Tyr Ala Ala lie 
130 135 140 



Ser Gin Val Asp Arg Leu Gin Ser Glu Pro Glu Ser He Arg Lys Trp 
145 150 155 160 



Arg Glu Glu Gin Met Glu Arg Leu Glu Ala Leu Asp Ala Asn Ser Arg 
165 170 175 



Lys Gin Glu Ala Glu Trp Lys Glu Lys Ala He Lys Glu Leu Glu Glu 
180 185 190 



Trp Tyr Ala Arg Gin Asp Glu Gin Leu Gin Lys Thr Lys Ala Asn Asn 
195 200 205 



Arg Ala Ala Glu Glu Ala Phe Val Asn Asp He Asp Glu Ser Ser Pro 
210 215 220 



Gly Thr Glu Trp Glu Arg Val Ala Arg Leu Cys Asp Phe Asn Pro Lys 
225 230 235 240 



Ser Ser Lys Gin Ala Lys Asp Val Ser Arg Met Arg Ser Val Leu lie 
245 250 255 



Ser Leu Lys Gin Ala Arg Xaa 
260 



<210> 217 

<211> 150 

<212> PRT 

<213> Homo sapien 
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<400> 217 

Met Asn Gly Glu Tyr Tyr Gin Glu Ser Asn Gly Pro Thr Asp Ser Tyr 
15 10 15 



Ala Ala lie Ser Gin Val Asp Arg Leu Gin Ser Glu Pro Glu Ser He 
20 25 30 



Arg Lys Trp Arg Glu Glu Gin Met Glu Arg Leu Glu Ala Leu Asp Ala 
35 40 45 



Asn Ser Arg Lys Gin Glu Ala Glu Trp Lys Glu Lys Ala He Lys Glu 
50 55 60 



Leu Glu Glu Trp Tyr Ala Arg Gin Asp Glu Gin Leu Gin Lys Thr Lys 
65 70 75 80 



Ala Asn Asn Arg Ala Ala Glu Glu Ala Phe Val Asn Asp He Asp Glu 
85 90 95 



Ser Ser Pro Gly Thr Glu Trp Glu Arg Val Ala Arg Leu Cys Asp Phe 
100 105 110 



Asn Pro Lys Ser Ser Lys Arg Ala Arg Cys Phe Pro Met Ala Gin Thr 
115 120 125 



His Ser Ser Arg Arg Pro Val Cys Leu Arg Ala Pro Val Glu Pro Thr 
130 135 140 



Leu He Phe He Tyr Gin 

150 



145 




<210> 


218 


<211> 


269 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


218 



Gly Phe Arg Phe Thr Arg Leu Pro Pro Gly Ala Cys Pro Pro Leu Pro 
15 10 15 



Val Gly Thr Thr Ala Val Ala Ala Gly Arg Gly Val Gly Gly Ser Val 
20 25 30 



Gly Phe Cys Leu Thr Val Gly Val Arg Ala Val Gin Leu Pro Ala Met 
35 40 45 
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Ala Glu Leu Asp Pro Phe Gly Ala Pro Ala Gly Ala Pro Gly Gly Pro 
• 50 55 60 



Ala Leu Gly Asn Gly Val Ala Gly Ala Gly Glu Glu Asp Pro Ala Ala 
65 70 75 80 



Ala Phe Leu Ala Gin Gin Glu Ser Glu lie Ala Gly He Glu Asn Asp 
85 90 95 



Glu Ala Phe Ala He Leu Asp Gly Gly Ala Pro Gly Pro Gin Pro His 
100 105 HO 



Gly Glu Pro Pro Gly Gly Pro Asp Ala Val Asp Gly Val Met Asn Gly 
115 120 125 



Glu Tyr Tyr Gin Glu Ser Asn Gly Pro Thr Asp Ser Tyr Ala Ala He 
130 135 140 



Ser Gin Val Asp Arg Leu Gin Ser Glu Pro Glu Ser He Arg Lys Trp 
145 150 155 160 



Arg Glu Glu Gin Met Glu Arg Leu Glu Ala Leu Asp Ala Asn Ser Arg 
165 170 175 



Lys Gin Glu Ala Glu Trp Lys Glu Lys Ala He Lys Glu Leu Glu Glu 
180 185 190 



Trp Tyr Ala Arg Gin Asp Glu Gin Leu Gin Lys Thr Lys Ala Asn Asn 
195 200 205 



Arg Ala Ala Glu Glu Ala Phe Val Asn Asp He Asp Glu Ser Ser Pro 
210 215 220 



Gly Thr Glu Trp Glu Arg Val Ala Arg Leu Cys Asp Phe Asn Pro Lys 
225 230 235 240 



Ser Ser Lys Arg Ala Arg Cys Phe Pro Met Ala Gin Ser His Ser Ser 
245 250 255 



Arg Arg Pro Val Cys Leu Arg Ala Leu Trp Asn Leu His 
260 265 



<210> 219 

<211> 152 

<212> PRT 

<213> Homo sapien 
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<400> 219 

Met Lys Ala Leu He Val Leu Gly Leu Val Leu Leu Ser Val Thr Val 
15 10 15 



Gin Gly Lys Val Phe Glu Arg Cys Glu Leu Ala Arg Thr Leu Lys Arg 
20 25 30 



Leu Gly Met Asp Gly Tyr Arg Gly He Ser Leu Ala Asn Trp Met Cys 
35 40 45 



Leu Ala Lys Trp Glu Ser Gly Tyr Asn Thr Arg Ala Thr Asn Tyr Asn 
50 55 60 



Ala Gly Asp Arg Ser Thr Asp Tyr Gly He Phe Gin He Asn Ser Arg 
65 70 75 80 



Tyr Trp Cys Asn Asp Gly Lys Thr Pro Gly Ala Val Asn Ala Cys His 
85 90 95 



Leu Ser Cys Ser Ala Leu Leu Gin Asp Asn He Ala Asp Ala Val Ala 
100 105 110 



Cys Ala Lys Arg Val Val Arg Asp Pro Gin Gly lie Arg Ala Trp Val 
115 120 125 



Ala Trp Arg Asn Arg Cys Leu Ser Phe Phe Ser Ser Phe Cys Leu Ser 
130 135 140 



His He Lys Gly Val Gly He Lys 

150 



145 




<210> 


220 


<211> 


178 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


220 



Ser Glu Gly Gly Ala Trp Thr Pro Gly Ser Glu Pro Thr Gly Ala Gin 
15 10 15 



Ala Cys Glu Phe Glu Ala Pro Ser His His Pro Pro Pro Gin Val Val 
20 25 30 



Tyr He Ala Cys Ser Phe Thr Thr Val Trp Leu He Tyr Ser Lys Phe 
35 40 45 
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Lys Ala Thr Tyr Asp Gly Asn His Asp Thr Phe Arg Val Glu Phe Leu 
50 55 60 



Val Val Pro Thr Ala lie Leu Ala Phe Leu Val Asn His Asp Phe Thr 
65 70 75 80 



Pro Leu Glu lie Leu Trp Thr Phe Ser lie Tyr Leu Glu Ser Val Ala 
85 90 95 



He Leu Pro Gin Leu Phe Met Val Ser Lys Thr Gly Glu Ala Glu Thr 
100 105 110 



He Thr Ser His Tyr Leu Phe Ala Leu Gly Val Tyr Arg Thr Leu Tyr 
115 120 125 



Leu Phe Asn Trp lie Trp Arg Tyr His Phe Glu Gly Phe Phe Asp Leu 
130 135 140 



He Ala He Val Ala Gly Leu Val Gin Thr Val Leu Tyr Cys Asp Phe 
145 150 155 160 



Phe Tyr Leu Tyr He Thr Lys Val Leu Lys Gly Lys Lys Leu Ser Leu 
165 170 175 



Pro Ala 



<210> 221 

<211> 183 

<212> PRT 

<213> Homo sapien 

<400> 221 

Leu His Pro Ser Gly Asp Pro Leu Asp Leu Leu His Leu Pro Gly Val 
15 10 15 



Ser Gly His Leu Ala Ala Ala Val His Gly Glu Gin Asp Arg Arg Gly 
20 25 30 



Gly Asp His His Gin Pro Leu Leu Val Cys Ala Arg Arg Leu Pro His 
35 40 45 



Ala Leu Ser Leu Gin Leu Asp Leu Ala Leu Pro Phe Arg Gly Leu Leu 
50 55 60 



Arg- Pro His Arg His Cys Gly Arg Pro Gly Pro Asp Ser Pro Leu Leu 
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65 70 75 



80 



Arg Phe Leu Leu Pro Leu Tyr His Gin Ser Pro Lys Gly Glu Glu Val 
85 90 95 

Glu Phe Ala Gly He Ala Pro Val Leu Ser He Ser Leu Leu Gly Ser 
100 105 no 



Ser Gly Arg Gin Arg Lys Ala Ala Glu Asp Glu Glu Leu Ser His Pro 
115 120 125 



Gly Val Thr Phe Leu Arg Thr His Leu Leu Cys Ser Pro Ser Arq Leu 
130 135 140 



Leu Pro Gly Phe Arg Gly Thr Val Glu Asp Pro Gly Leu Gly Glu Leu 
145 150 155 U 160 

Arg Thr Trp Ala Val Cys Ser Phe Leu Pro Phe Arg Gin Glu Lys Lys 
165 170 175 



He Phe Pro Leu Phe Ser Phe 
180 



<210> 222 

<211> 742 

<212> PRT 

<213> Homo sapien 

<400> 222 

Met Ala Asp Ala Gly He Arg Arg Val Val Pro Ser Asp Leu Tyr Pro 
1 5 10 15 

Leu Val Leu Gly Phe Leu Arg Asp Asn Gin Leu Ser Glu Val Ala Asn 
20 25 30 

Lys Phe Ala Lys Ala Thr Gly Ala Thr Gin Gin Asp Ala Asn Ala Ser 
35 40 45 

Ser Leu Leu Asp He Tyr Ser Phe Trp Leu Asn Arg Ser Ala Lys Val 
50 55 60 

Pro Glu Arg Lys Leu Gin Ala Asn Gly Pro Val Ala Lys Lys Ala Lys 
65 70 75 * 80 



Lys Lys Ala Ser Ser Ser Asp Ser Glu Asp Ser Ser Glu Glu Glu Glu 
85 90 95 
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Glu Val Gin Gly Pro Pro Ala Lys Lys Ala Ala Val Pro Ala Lys Arg 
100 105 110 



Val Gly Leu Pro Pro Gly Lys Ala Ala Ala Lys Ala Ser Glu Ser Ser 
115 120 125 



Ser Ser Glu Glu Ser Ser Asp Asp Asp Asp Glu Glu Asp Gin Lys Lys 
130 135 140 

Gin Pro Val Gin Lys Gly Val Lys Pro Gin Ala Lys Ala Ala Lys Ala 
145 150 155 160 



Pro Pro Lys Lys Ala Lys Ser Ser Asp Ser Asp Ser Asp Ser Ser Ser 
165 170 175 



Glu Asp Glu Pro Pro Lys Asn Gin Lys Pro Lys lie Thr Pro Val Thr 
180 185 190 



Val Lys Ala Gin Thr Lys Ala Pro Pro Lys Pro Ala Arg Ala Ala Pro 
195 200 205 



Lys He Ala Asn Gly Lys Ala Ala Ser Ser Ser Ser Ser Ser Ser Ser 
210 215 220 



Ser Ser Ser Ser Asp Asp Ser Glu Glu Glu Lys Ala Ala Ala Thr Pro 
225 230 235 240 



Lys Lys Val Trp Thr He Thr Ser Val Arg Ala Glu Val Thr Arg Ala 
245 250 255 



Val Met Cys Val Cys Leu Pro Ser Leu Ala Gly Leu Val Gly Ser Ala 
260 265 270 



Phe Ser Cys Gly Glu Ser He Phe Phe Leu Met Gin Thr Val Pro Lys 
275 280 285 



Lys Gin Val Val Ala Lys Ala Pro Val Lys Ala Ala Thr Thr Pro Thr 
290 295 300 



Arg Lys Ser Ser Ser Ser Glu Asp Ser Ser Ser Asp Glu Glu Glu Glu 
305 ~ 310 315 320 



Gin Lys Lys Pro Met Lys Asn Lys Pro Gly Pro Tyr Ser Ser Val Pro 
325 330 335 



WO 2004/050858 



343/383 



PCT/US2003/038808 



Pro Pro Ser Ala Pro Pro Pro Lys Lys Ser Leu Gly Thr Gin Pro Pro 
340 345 350 



Lys Lys Ala Val Glu Lys Gin Gin Pro Val Glu Ser Ser Glu Asp Ser 
355 360 365 



Ser Asp Glu Ser Asp Ser Ser Ser Glu Glu Glu Lys Lys Pro Pro Thr 
370 375 380 



Lys Ala Val Val Ser Lys Ala Thr Thr Lys Pro Pro Pro Ala Lys Lys 

385 390 395 400 

Ala Ala Glu Ser Ser Ser Asp Ser Ser Asp Ser Asp Ser Ser Glu Asp 
405 410 415 



Asp Glu Ala Pro Ser Lys Pro Ala Gly Thr Thr Lys Asn Ser Ser Asn 
420 425 430 



Lys Pro Ala Val Thr Thr Lys Ser Pro Ala Val Lys Pro Ala Ala Ala 
435 440 445 



Pro Lys Gin Pro Val Gly Gly Gly Gin Lys Leu Leu Thr Arg Lys Ala 
450 455 460 



Asp Ser Ser Ser Ser Glu Glu Glu Ser Ser Ser Ser Glu Glu Glu Lys 

465 470 475 480 

Thr Lys Lys Met Val Ala Thr Thr Lys Pro Lys Ala Thr Ala Lys Ala 

485 490 495 



Ala Leu Ser Leu Pro Ala Lys Gin Ala Pro Gin Gly Ser Arg Asp Ser 
500 505 ^ ~ 



510 



Ser Ser Asp Ser Asp Ser Ser Ser Ser Glu Glu Glu Glu Glu Lys Thr 
515 520 



525 



Ser Lys Ser Ala Val Lys Lys Lys Pro Gin Lys Val Ala Gly Gly Ala 
530 535 540 



Ala Pro Ser Lys Pro Ala Ser Ala Lys Lys Gly Lys Ala Glu Ser Ser 
545 550 555 560 



Asn Ser Ser Ser Ser Asp Asp Ser Ser Glu Glu Glu Glu Glu Lys Leu 
565 570 575 



Lys Gly Lys Gly Ser Pro Arg Pro Gin Ala Pro Lys Ala Asn Gly Thr 
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580 585 590 



Ser Ala Leu Thr Ala Gin Asn Gly Lys Ala Ala Lys Asn Ser Glu Glu 
595 600 605 



Glu Glu Glu Glu Lys Lys Lys Ala Ala Val Val Val Ser Lys Ser Gly 
610 615 620 



Ser Leu Lys Lys Arg Lys Gin Asn Glu Ala Ala Lys Glu Ala Glu Thr 
625 630 635 640 



Pro Gin Ala Lys Lys lie Lys Leu Gin Thr Pro Asn Thr Phe Pro Lys 
645 650 655 



Arg Lys Lys Gly Glu Lys Arg Ala Ser Ser Pro Phe Arg Arg Val Arg 
660 665 670 



Glu Glu Glu lie Glu Val Asp Ser Arg Val Ala Asp Asn Ser Phe Asp 
675 680 685 



Ala Lys Arg Gly Ala Ala Gly Asp Trp Gly Glu Arg Ala Asn Gin Val 
690 695 700 



Leu Lys Phe Thr Lys Gly Lys Ser Phe Arg His Glu Lys Thr Lys Lys 
705 710 715 720 



Lys Arg Gly Ser Tyr Arg Gly Gly Ser lie Ser Val Gin Val Asn Ser 
725 730 735 



lie Lys Phe Asp Ser Glu 
740 



<210> 223 

<211> 288 

<212> PRT 

<213> Homo sapien 

<400> 223 

Leu Ser Gly His Leu Leu Leu Lys Gin Ser Gly Pro Thr Gin Pro Glu 
15 10 15 



Arg Thr Ala Phe Leu Ser Asn Pro Gin Pro Leu lie Phe Ala Asn Leu 
20 25 30 



Lys His Tyr Gin Lys Gin Gin Ser Leu Pro Ser Leu Cys Ser Thr Ser 
35 40 45 
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Asp Pro Asp Thr Pro Leu Gly Ala Pro Ser Ser Ser Asp Ser Asp Ser 
50 55 60 



Ser Ser Ser Glu Glu Glu Glu Glu Lys Thr Ser Lys Ser Ala Val Lys 
65 70 75 80 



Lys Lys Pro Gin Lys Val Ala Gly Gly Ala Ala Pro Ser Lys Pro Ala 
85 90 95 



Ser Ala Lys Lys Gly Lys Ala Glu Ser Ser Asn Ser Ser Ser Ser Asp 
100 105 110 



Asp Ser Ser Glu Glu Glu Glu Glu Lys Leu Lys Gly Lys Gly Ser Pro 
115 120 125 



Arg Pro Gin Ala Pro Lys Ala Asn Gly Thr Ser Ala Leu Thr Ala Gin 
130 135 140 



Asn Gly Lys Ala Ala Lys Asn Ser Glu Glu Glu Glu Glu Glu Lys Lys 
145 150 155 160 



Lys Ala Ala Val Val Val Ser Lys Ser Gly Ser Leu Lys Lys Arg Lys 
165 170 175 



Gin Asn Glu Ala Ala Lys Glu Ala Glu Thr Pro Gin Ala Lys Lys lie 
180 185 190 



Lys Leu Gin Thr Pro Asn Thr Phe Pro Lys Arg Lys Lys Gly Glu Lys 
195 200 205 



Arg Ala Ser Ser Pro Phe Arg Arg Val Arg Glu Glu Glu lie Glu Val 
210 215 220 



Asp Ser Arg Val Ala Asp Asn Ser Phe Asp Ala Lys Arg Gly Ala Ala 
225 230 235 240 



Gly Asp Trp Gly Glu Arg Ala Asn Gin Val Leu Lys Phe Thr Lys Gly 
245 250 255 



Lys Ser Phe Arg His Glu Lys Thr Lys Lys Lys Arg Gly Ser Tyr Arg 
260 265 270 



Gly Gly Ser He Ser Val Gin Val Asn Ser He Lys Phe Asp Ser Glu 
275 280 * 285 
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<210> 224 

<211> 265 

<212> PRT 

<213> Homo sapien 

<400> .224 

Ser Ser Ala Pro Asp Phe Ala Asn Leu Lys His Tyr Gin Lys Gin Gin 
15 10 15 



Ser Leu Pro Ser Leu Cys Ser Thr Ser Asp Pro Asp Thr Pro Leu Gly 
20 25 30 



Ala Pro Ser Ser Ser Asp Ser Asp Ser Ser Ser Ser Glu Glu Glu Glu 
35 40 45 



Glu Lys Thr Ser Lys Ser Ala Val Lys Lys Lys Pro Gin Lys Val Ala 
50 55 60 



Gly Gly Ala Ala Pro Ser Lys Pro Ala Ser Ala Lys Lys Gly Lys Ala 
65 70 75 80 



Glu Ser Ser Asn Ser Ser Ser Ser Asp Asp Ser Ser Glu Glu Glu Glu 
85 90 , 95 



Glu Lys Leu Lys Gly Lys Gly Ser Pro Arg Pro Gin Ala Pro Lys Ala 
100 105 110 



Asn Gly Thr Ser Ala Leu Thr Ala Gin Asn Gly Lys Ala Ala Lys Asn 
115 120 125 



Ser Glu Glu Glu Glu Glu Glu Lys Lys Lys Ala Ala Val Val Val Ser 
130 135 140 



Lys Ser Gly Ser Leu Lys Lys Arg Lys Gin Asn Glu Ala Ala Lys Glu 
145 150 155 160 



Ala Glu Thr Pro Gin Ala Lys Lys lie Lys Leu Gin Thr Pro Asn Thr 
165 170 175 



Phe Pro Lys Arg Lys Lys Gly Glu Lys Arg Ala Ser Ser Pro Phe Arg 
180 185 190 



Arg Val Arg Glu Glu Glu lie Glu Val Asp Ser Arg Val Ala Asp Asn 
195 200 205 



Ser Phe Asp Ala Lys Arg Gly Ala Ala Gly Asp Trp Gly Glu Arg Ala 
210 215 220 
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Asn Gin Val Leu Lys Phe Thr Lys Gly Lys Ser Phe Arg His Glu Lys 
225 230 235 240 



Thr Lys Lys Lys Arg Gly Ser Tyr Arg Gly Gly Ser He Ser Val Gin 
245 250 255 



Val Asn Ser He Lys Phe Asp Ser Glu 
260 265 



<210> 225 

<211> 1276 

<212> PRT 

<213> Homo sapien 

<400> 225 

Met He Leu Val Gin He Leu Lys Gin Glu Trp Pro Lys His Trp Pro 
15 10 15 



Thr Phe He Ser Asp He Val Gly Ala Ser Arg Thr Ser Glu Ser Leu 
20 25 30 



Cys Gin Asn Asn Met Val He Leu Lys Leu Leu Ser Glu Glu Val Phe 
35 40 45 



Asp Phe Ser Ser Gly Gin He Thr Gin Val Lys Ser Lys His Leu Lys 
50 55 60 



Asp Ser Met Cys Asn Glu Phe Ser Gin He Phe Gin Leu Cys Gin Phe 
65 70 75 80 



Val Met Glu Asn Ser Gin Asn Ala Pro Leu Val His Ala Thr Leu Glu 
85 90 95 



Thr Leu Leu Arg Phe Leu Asn Trp He Pro Leu Gly Tyr He Phe Glu 
100 105 110 



Thr Lys Leu He Ser Thr Leu He Tyr Lys Phe Leu Asn Val Pro Met 
115 120 125 



Phe Arg Asn Val Ser Leu Lys Cys Leu Thr Glu He Ala Gly Val Ser 
130 135 140 



Val Ser Gin Tyr Glu Glu Gin Phe Val Thr Leu Phe Thr Leu Thr Met 
145 150 155 160 
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Met Gin Leu Lys Gin Met Leu Pro Leu Asn Thr Asn He Arg Leu Ala 
165 170 175 



Tyr Ser Asn Gly Lys Asp Asp Glu Gin Asn Phe He Gin Asn Leu Ser 
180 185 190 



Leu Phe Leu Cys Thr Phe Leu Lys Glu His Asp Gin Leu He Glu Lys 
195 200 205 



Arg Leu Asn Leu Arg Glu Thr Leu Met Glu Ala Leu His Tyr Met Leu 
210 215 220 



Leu Val Ser Glu Val Glu Glu Thr Glu He Phe Lys He Cys Leu Glu 
225 230 235 240 



Tyr Trp Asn His Leu Ala Ala Glu Leu Tyr Arg Glu Ser Pro Phe Ser 
245 . 250 255 



Thr Ser Ala Ser Pro Leu Leu Ser Gly Ser Gin His Phe Asp Val Pro 
260 265 270 



Pro Arg Arg Gin Leu Tyr Leu Pro Met Leu Phe Lys Val Thr Glu Arg 
275 280 285 



Leu Val Glu Cys Ser Ser Cys Cys He Leu Trp Phe Leu Arg Ser Glu 
290 295 300 



Ser Lys Tyr Phe Tyr He Cys Val Asn Lys Leu Ala He Lys Arg Glu 
305 310 315 320 



Pro Asn Asn Phe Ser Met Ser Val Glu Asn Gin Asn Met Lys Gly Val 
325 330 335 



Glu Ser Arg Thr Leu He Leu Lys Ser Val Val Leu Leu Ser Val Ser 
340 345 350 



Thr Leu Val Val He Ser Leu Gly Lys Phe He Ala Thr Cys Gin Ser 
355 360 365 



Thr Lys Pro Glu Ser Arg Asn Glu Thr Gin Glu Thr Pro Val Thr Glu 
370 375 380 



Val Gly Glu Lys Asn His He Lys Thr His Leu Asn Asn Tyr Lys Ala 
385 390 395 400 



He Phe Val Glu Leu Gin Trp Lys Lys Asn Phe Phe Phe Trp Arg Gin 
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405 



410 



415 



Gly Leu Ala Leu Trp Leu Arg Leu Glu Cys Ser Gly Val Val He Ala 
420 425 430 



His Tyr Asn Leu Glu Leu Leu Asp Tyr Lys Gin Phe Ser Cys Val Ser 
435 440 445 



Leu Pro Ser Asn Trp Leu Gin Ala His Thr Thr Met Pro Glu Gin He 
450 455 460 



Leu Asn Phe Leu Val Leu Leu His Leu Tyr Leu His Ser Phe Phe Cys 
465 470 475 480 



Cys Ala Leu Ser Pro Ser He Phe He Val' Leu Glu He Gly Cys His 
485 490 495 



Phe Val Leu Leu Lys Leu Leu Ser Asn Ser Trp Pro Gin Gin Phe Phe 
500 505 510 



Leu Pro Arg Pro Ser Gly He Val Gly He He Gly Met Asn Pro Tyr 
515 520 525 



Thr Trp Leu Ala Cys Gly Phe Phe Gly Phe Val Cys Phe Glu Leu Ser 
530 535 540 



Phe Tyr Thr Asn Asp Phe Ser Asp Leu Thr He Phe Phe Leu He Asp 
545 550 555 560 



He Leu Leu Ser Met Val Arg Leu Leu Met Val Ser Arg Met Ala Lys 
565 570 575 



Pro Glu Glu Val Leu Val Val Glu Asn Asp Gin Gly Glu Val Val Arg 
580 585 590 



Glu Phe Met Lys Asp Thr Asp Ser He Asn Leu Tyr Lys Asn Met Arg 
595 600 605 



Glu Thr Leu Gly Lys Leu He Asn Thr Val Asn Leu Tyr Leu Tyr Lys 
610 615 620 



Ser He Leu Gly Trp Lys Tyr He Leu Gly Asn Val Leu Val Val Ser 
625 630 635 640 



Lys Tyr Phe Leu Asn Val Leu Leu Leu He Asn Lys He Leu Leu He 
645 650 655 
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lie Ala Phe Glu Thr Met Phe Phe Phe Val Val Tyr Leu Thr His Leu 
660 665 670 



Asp Tyr Val Asp Thr Glu Arg lie Met Thr Glu Lys Leu His Asn Gin 
675 680 685 



Val Asn Gly Thr Glu Trp Ser Trp Lys Asn Leu Asn Thr Leu Cys Trp 
690 695 700 



Ala He Gly Ser He Ser Gly Ala Met His Glu Glu Asp Glu Lys Arg 
705 710 715 720 



Phe Leu Val Thr Val He Lys Asp Leu Leu Gly Leu Cys Glu Gin Lys 
725 730 735 



Arg Gly Lys Asp Asn Lys Ala He He Ala Ser Asn He Met Tyr He 
740 745 750 



Val Gly Gin Tyr Pro Arg Phe Leu Arg Ala His Trp Lys Phe Leu Lys 
755 760 765 



Thr Val Val Asn Lys Leu Phe Glu Phe Met His Glu Thr His Asp Gly 
770 775 780 



Val Gin Asp Met Ala Cys Asp Thr Phe lie Lys He Ala Gin Lys Cys 
785 790 795 800 



Arg Arg His Phe Val Gin Val Gin Val Gly Glu Val Met Pro Phe He 
805 810 815 



Asp Glu He Leu Asn Asn He Asn Thr He He Cys Asp Leu Gin Pro 
820 825 830 



Gin Gin Val His Thr Phe Tyr Glu Ala Val Gly Tyr Met He Gly Ala 
835 840 845 



Gin Thr Asp Gin Thr Val Gin Glu His Leu He Glu Lys Tyr Met Leu 
850 855 860 



Leu Pro Asn Gin Val Trp Asp Ser He He Gin Gin Ala Thr Lys Asn 
865 870 875 880 



Val Asp He Leu Lys Asp Pro Glu Thr Val Lys Gin Leu Gly Ser lie 
885 890 895 
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Leu Lys Thr Asn Val Arg Ala Cys Lys Ala Val Gly His Pro Phe Val 
900 905 910 



lie Gin Leu Gly Arg lie Tyr Leu Asp Met Leu Asn Val Tyr Lys Cys 
915 920 925 



Leu Ser Glu Asn He Ser Ala Ala He Gin Ala Asn Gly Glu Met Val 
930 935 940 



Thr Lys Gin Pro Leu He Arg Ser Met Arg Thr Val Lys Arg Glu Thr 
945 950 955 960 



Leu Lys Leu He Ser Gly Trp Val Ser Arg Ser Asn Asp Pro Gin Met 
965 970 975 



Val Ala Glu Asn Phe Val Pro Pro Leu Leu Asp Ala Val Leu He Asp 
980 985 990 



Tyr Gin Arg Asn Val Pro Ala Ala Arg Glu Pro Glu Val Leu Ser Thr 
995 1000 1005 



Met Ala He He Val Asn Lys Leu Gly Gly His He Thr Ala Glu 
1010 1015 1020 



He Pro Gin He Phe Asp Ala Val Phe Glu Cys Thr Leu Asn Met 
1025 1030 1035 



He Asn Lys Asp Phe Glu Glu Tyr Pro Glu His Arg Thr Asn Phe 
1040 1045 1050 



Phe Leu Leu Leu Gin Ala Val Asn Ser His Cys Phe Pro Ala Phe 
1055 1060 ** 1065 



Leu Ala He Pro Pro Thr Gin Phe Lys Leu Val Leu Asp Ser He 
1070 1075 1080 



lie Trp Ala Phe Lys His Thr Met Arg Asn Val Ala Asp Thr Gly 
1085 1090 1095 



Leu Gin He Leu Phe Thr Leu Leu Gin Asn Val Ala Gin Glu Glu 
1100 H05 1110 



Ala Ala Ala Gin Ser Phe Tyr Gin Thr Tyr Phe Cys Asp lie Leu 
1115 1120 " 1125 
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Gin His He Phe Ser Val Val Thr Asp Thr Ser His Thr Ala Gly 
1130 1135 1140 



Leu Thr Met His Ala Ser He Leu Ala Tyr Met Phe Asn Leu Val 
1145 1150 1155 



Glu Glu Gly Lys He Ser Thr Ser Leu Asn Pro Gly Asn Pro Val 
1160 1165 1170 



Asn Asn Gin He Phe Leu Gin Glu Tyr Val Ala Asn Leu Leu Lys 
1175 1180 1185 



Ser Ala Phe Pro His Leu Gin Asp Ala Gin Val Lys Leu Phe Val 
1190 1195 1200 



Thr Gly Leu Phe Ser Leu Asn Gin Asp He Pro Ala Phe Lys Glu 
1205 1210 1215 



His Leu Arg Asp Phe Leu Val Gin He Lys Glu Phe Ala Gly Glu 
1220 1225 1230 



Asp Thr Ser Asp Leu Phe Leu Glu Glu Arg Glu He Ala Leu Arg 
1235 1240 1245 



Gin Ala Asp Glu Glu Lys His Lys Arg Gin Met Ser Val Pro Gly 
1250 1255 1260 



He Phe Asn Pro His Glu He Pro Glu Glu Met Cys Asp 
1265 1270 1275 



<210> 226 

<211> 632 

<212> PRT 

<213> Homo sapien 

<400> 226 

Met Tyr Cys Tyr Ser Ser He Lys Phe Cys Ser Leu Phe Ala Phe Glu 
15 10 15 



Thr Met Phe Phe Phe Val Val Tyr Leu Thr His Leu Asp Tyr Val Asp 
20 25 30 



Thr Glu Arg He Met Thr Glu Lys Leu His Asn Gin Val Asn Gly Thr 
35 40 45 



Glu Trp Ser Trp Lys Asn Leu Asn Thr Leu Cys Trp Ala He Gly Ser 
50 55 60 
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lie Ser Gly Ala Met His Glu Glu Asp Glu Lys Arg Phe Leu Val Thr 
65 70 75 80 



Val He Lys Asp Leu Leu Gly Leu Cys Glu Gin Lys Arg Gly Lys Asp 
85 90 95 



Asn Lys Ala He He Ala Ser Asn He Met Tyr He Val Gly Gin Tyr 
100 105 110 



Pro Arg Phe Leu Arg Ala His Trp Lys Phe Leu Lys Thr Val Val Asn 
115 ~ 120 125 



Lys Leu Phe Glu Phe Met His Glu Thr His Asp Gly Val Gin Asp Met 
130 135 140 



Ala Cys Asp Thr Phe He Lys He Ala Gin Lys Cys Arg Arg His Phe 
145 150 155 160 



Val Gin Val Gin Val Gly Glu Val Met Pro Phe He Asp Glu He Leu 
165 170 175 



Asn Asn He Asn Thr He He Cys Asp Leu Gin Pro Gin Gin Val His 
180 185 190 



Thr Phe Tyr Glu Ala Val Gly Tyr Met He Gly Ala Gin Thr Asp Gin 
195 200 205 



Thr Val Gin Glu His Leu He Glu Lys Tyr Met Leu Leu Pro Asn Gin 
210 215 220 



Val Trp Asp Ser He He Gin Gin Ala Thr Lys Asn Val Asp He Leu 
225 230 235 240 



Lys Asp Pro Glu Thr Val Lys Gin Leu Gly Ser He Leu Lys Thr Asn 
245 250 255 



Val Arg Ala Cys Lys Ala Val Gly His Pro Phe Val He Gin Leu Gly 
260 265 270 



Arg He Tyr Leu Asp Met Leu Asn Val Tyr Lys Cys Leu Ser Glu Asn 
275 280 285 



He Ser Ala Ala He Gin Ala Asn Gly Glu Met Val Thr Lys Gin Pro 
290 295 300 
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Leu He Arg Ser Met Arg Thr Val Lys Arg Glu Thr Leu Lys Leu He 
305 310 315 320 



Ser Gly Trp Val Ser Arg Ser Asn Asp Pro Gin Met Val Ala Glu Asn 
325 330 • 335 



Phe Val Pro Pro Leu Leu Asp Ala Val Leu He Asp Tyr Gin Arg Asn 
340 345 350 

Val Pro Ala Ala Arg Glu Pro Glu Val Leu Ser Thr Met Ala He He 
355 360 365 

Val Asn Lys Leu Gly Gly His He Thr Ala Glu He Pro Gin He Phe 
370 375 380 



Asp Ala Val Phe Glu Cys Thr Leu Asn Met He Asn Lys Asp Phe Glu 
385 390 395 400 



Glu Tyr Pro Glu His Arg Thr Asn Phe Phe Leu Leu Leu Gin Ala Val 
405 410 415 



Asn Ser His Cys Phe Pro Ala Phe Leu Ala He Pro Pro Thr Gin Phe 
420 425 430 



Lys Leu Val Leu Asp Ser He He Trp Ala Phe Lys His Thr Met Arg 
435 440 445 

Asn Val Ala Asp Thr Gly Leu Gin He Leu Phe Thr Leu Leu Gin Asn 
450 455 460 

Val Ala Gin Glu Glu Ala Ala Ala Gin Ser Phe Tyr Gin Thr Tyr Phe 
465 470 475 480 



Cys Asp He Leu Gin His He Phe Ser Val Val Thr Asp Thr Ser His 
485 490 495 



Thr Ala Gly Leu Thr Met His Ala Ser He Leu Ala Tyr Met Phe Asn 
500 505 510 



Leu Val Glu Glu Gly Lys He Ser Thr Ser Leu Asn Pro Gly Asn Pro 
515 520 525 



Val Asn Asn Gin He Phe Leu Gin Glu Tyr Val Ala Asn Leu Leu Lys 
530 535 540 
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Ser Ala Phe Pro His Leu Gin Asp Ala Gin Val Lys Leu Phe Val Thr 
545 550 555 560 



Gly Leu Phe Ser Leu Asn Gin Asp lie Pro Ala Phe Lys Glu His Leu 
565 570 575 



Arg Asp Phe Leu Val Gin He Lys Glu Phe Ala Gly Glu Asp Thr Ser 
580 585 590 



Asp Leu Phe Leu Glu Glu Arg Glu He Ala Leu Arg Gin Ala Asp Glu 
595 600 605 



Glu Lys His Lys Arg Gin Met Ser Val Pro Gly He Phe Asn Pro His 
610 615 620 



Glu He Pro Glu Glu Met Cys Asp 

630 



625 




<210> 


227 


<211> 


159 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


227 



Met Glu Leu His Tyr Leu Thr His Phe Gin He Pro Asp Arg He Ser 
15 10 15 



Thr Val Arg Arg Phe Thr Leu Leu Leu Ser Phe Leu Leu Arg Arg Leu 
20 25 30 



Val Thr Leu He Ser Gin Gin Ala Thr Leu Leu Ala Ser Asn Glu Ala 
35 40 45 



Phe Lys Lys GLn Ala Glu Ser Ala Ser Glu Ala Ala Lys Lys Tyr Met 
50 55 60 



Glu Glu Asn Asp Gin Leu Lys Lys Gly Ala Ala Val Asp Gly Gly Lys 
65 70 75 80 



Leu Asp Val Gly Asn Ala Glu Val Lys Leu Glu Glu Glu Asn Arg Ser 
85 90 95 



Leu Lys Ala Asp Leu Gin Lys Leu Lys Asp Glu Leu Ala Ser Thr Lys 
100 105 110 



Gin Lys Leu Glu Lys Ala Glu Asn Gin Val Leu Ala Met Arg Lys Gin 
115 120 125 
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Ser Glu Gly Leu Thr Lys Glu Tyr Asp Arg Leu Leu Glu Glu His Ala 
130 135 140 



Lys Leu Gin Ala Ala Val Asp Gly Pro Met Asp Lys Lys Glu Glu 

150 155 



145 




<210> 


228 


<211> 


313 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


228 



Met Gly Ala Glu Ala Ser Ser Ser Trp Cys Pro Gly Thr Ala Leu Pro 
1 5 10 15 



Glu Glu Arg Leu Ser Val Lys Arg Ala Ser Glu lie Ser Gly Phe Leu 
20 25 30 



Gly Gin Gly Ser Ser Gly Glu Ala Ala Leu Asp Val Leu Thr His Val 
35 40 45 



Leu Glu Gly Ala Gly Asn Lys Leu Thr Ser Ser Cys Gly Lys Pro Ser 
50 55 60 



Ser Asn Arg Met Ser Leu Gin Trp Thr Ala Val Ala Thr Phe Leu Tyr 
65 70 75 80 



Ala Glu Val Phe Val Val Leu Leu Leu Cys lie Pro Phe lie Ser Pro 
85 90 95 



Lys Arg Trp Gin Lys He Phe Lys Ser Arg Leu Val Glu Leu Leu Val 
100 105 110 



Ser Tyr Gly Asn Thr Phe Phe Val Val Leu He Val He Leu Val Leu 
115 120 125 



Leu Val He Asp Ala Val Arg Glu lie Arg Lys Tyr Asp Asp Val Thr 
130 135 ' 14 0 



Glu Lys Val Asn Leu Gin Asn Asn Pro Gly Ala Met Glu His Phe His 
145 150 155 160 



Met Lys Leu Phe Arg Ala Gin Arg Asn Leu Tyr He Ala Gly Phe Ser 
165 170 175 
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Leu Leu Leu Ser Phe Leu Leu Arg Arg Leu Val Thr Leu lie Ser Gin 
180 185 190 



Gin Ala Thr Leu Leu Ala Ser Asn Glu Ala Phe Lys Lys Gin Ala Glu 
195 200 205 



Ser Ala Ser Glu Ala Ala Lys Lys Tyr Met Glu Glu Asn Asp Gin Leu 
210 215 220 



Lys Lys Gly Ala Ala Val Asp Gly Gly Lys Leu Asp Val Gly Asn Ala 
225 230 235 240 



Glu Val Lys Leu Glu Glu Glu Asn Arg Ser Leu Lys Ala Asp Leu Gin 
245 250 255 



Lys Leu Lys Asp Glu Leu Ala Ser Thr Lys Gin Lys Leu Glu Lys Ala 
260 265 270 



Glu Asn Gin Val Leu Ala Met Arg Lys Gin Ser Glu Gly Leu Thr Lys 
275 280 285 



Glu Tyr Asp Arg Leu Leu Glu Glu His Ala Lys Leu Gin Ala Ala Val 
290 295 300 



Asp Gly Pro Met Asp Lys Lys Glu Glu 

310 



305 




<210> 


229 


<211> 


301 


<212> 


PRT 


<213> 


Homo sap i en 


<400> 


229 



Met Ser Leu Gin Trp Thr Ala Val Ala Thr Phe Leu Tyr Ala Glu Val 
1 5 10 4 15 



Phe Val Val Leu Leu Leu Cys lie Pro Phe lie Ser Pro Lys Arg Trp 
20 25 3 0 



Gin Lys He Phe Lys Ser Arg Leu Val Glu Leu Leu Val Ser Tyr Gly 
35 40 45 



Asn Thr Phe Phe Val Val Leu He Val He Leu Val Leu Leu Val He 
50 55 60 



Asp Ala Val Arg Glu He Arg Lys Tyr Asp Asp Val Thr Glu Lys Val 
65 70 75 80 
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Asn Leu Gin Asn Asn Pro Gly Ala Met Glu His Phe His Met Lys Leu 
85 90 95 



Phe Arg Ala Gin Arg Asn Leu Tyr He Ala Gly Phe Ser Leu Leu Leu 
100 105 HO 



Ser Phe Leu Leu Arg Arg Leu Val Thr Leu He Ser Gin Gin Ala Thr 
115 120 125 



Leu Leu Ala Ser Asn Glu Ala Phe Lys Lys Gin Ala Glu Ser Ala Ser 
13.0 135 140 



Glu Ala Ala Lys Lys Tyr Met Glu Glu Asn Asp Gin Leu Lys Lys Gly 
145 * 150 155 160 



Ala Ala Val Asp Gly Gly Lys Leu Asp Val Gly Asn Ala Glu Val Lys 
165 170 175 



Leu Glu Glu Glu Asn Arg Ser Leu Lys Ala Asp Leu Gin Lys Leu Lys 
180 185 190 



Asp Glu Leu Ala Ser Thr Lys Gin Lys Leu Glu Lys Ala Glu Asn Gin 
195 200 205 



Val Leu Ala Met Arg Lys Gin Ser Glu Gly Leu Thr Lys Glu Tyr Asp 
210 215 220 



Arg Leu Leu Glu Glu His Ala Lys Leu Gin Ser Pro Ala Ser Pro Ser 
225 230 235 240 



Ser Thr Ser Val Cys Ala Leu Leu Leu Pro Pro Phe Pro Ser Thr Ala 
245 250 255 



His Ser Ser Ser Ser Arg Pro Leu Ser Thr Leu Ser Lys His He Thr 
260 265 270 



Gly Asp Leu He Ala Thr Arg Ser Glu Cys Val Cys Cys His Pro Ala 
275 280 285 



Trp Pro Gly Gin Ala Trp His Ser Leu Gly Phe His Ala 
290 295 300 



<210> 230 
<211> 98 
<212> PRT 
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<213> Homo sapien 
<220> 

<221> MIS COFEATURE 

<222> (2) . . (2) 

<223> X=any amino acid 



<220> 

<221> MI SC_FE ATURE 

<222> (25).. (25) 

<223> X=any amino acid 



<220> 

<221> MI SC_FE ATURE 

<222> (29) . . (29) 

<223> X=any amino acid 



<220> 

<221> MI SC_FEATURE 

<222> (68).. (68) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (73) . . (73) 

<223> X=any amino acid 



<400> 230 

lie Xaa Ser Pro Phe Ser Tyr Thr Pro Pro Phe Pro Pro Thr Gly Gly 
1 5 10 15 



Ala Ala Leu Ser Ala Arg Gly He Xaa Gly Pro Glu Xaa Glu Pro Ala 
20 25 30 



Ala Ala Ala Val Ala Arg Gly Lys Arg Glu Glu Ala Ala Thr Gly Glu 
35 40 45 



Lys Lys Arg Ser Arg His Arg Arg Glu Val Asp Thr Gin Ala Ala Ala 
50 55 60 



Ala Ala Arg Xaa Ala Arg Pro Pro Xaa Arg Pro Ser Pro Arg Pro Val 
65 70 75 80 



His Lys Ala Ala Arg Leu Arg Lys Gly Val Arg Thr Lys Gly Gin Ser 
85 90 95 



Leu Gin 
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<210> 231 

<211> 130 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC_FEATURE 

<222> (2).. (2) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (4).. (4) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (25).. (25) 

<22 3> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (29).. (29) 

<223> X=any amino acid 



<220> 

<221> MI SC_ FEATURE 

<222> (67).. (67) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (73).. (73) 

<223> X=any amino acid 



<220> 

<221> MI S COFEATURE 

<222> (128) . . (128) 

<223> X=any amino acid 



<400> 231 

Phe Xaa Pro Xaa Phe Pro Thr Pro Pro Pro Ser Pro Arg Pro Glu Glu 
1 5 10 15 



Pro Leu Phe Pro Arg Gly Ala Phe Xaa Gly Pro Arg Xaa Ser Pro Pro 
20 25 * 30 



Pro Pro Pro Ser Pro Glu Gly Ser Glu Lys Arg Pro Arg Pro Glu Arg 
35 40 45 
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Lys Ser Gly Val Ala Thr Gly Glu Lys Ser Thr Pro Lys Gin Gin Pro 
50 55 60 



Pro Pro Xaa Arg Pro Ala His Gin Xaa Ala Arg Pro Pro Ala Pro Ser 
65 70 75 80 



Thr Lys Gin Pro Ala Ser Ala Lys Gly Tyr Glu Arg Arg Asp Lys Val 
85 90 *" 95 



Ser Asn Glu Lys Ser Val Glu Asp Arg Lys Glu Val Lys Glu Lys Lys 
100 105 ^ 110 



Glu Lys Thr Lys Gin Thr lie Ser Cys Pro Gly Gly Ser Glu Lys Xaa 
115 120 125 



Lys His 
130 



<210> 232 

<211> 260 

<212> PRT 

<213> Homo sapien 

<400> 232 

Met Asp Met Ser Leu Glu Leu Asp lie lie Lys Leu Asn Arg Ser Gin 
1 5 10 15 



Arg Gly Gly Trp Gly Gly Cys Arg Gly Leu Arg Ala Val Cys Arg Leu 
20 25 30 



Pro Val Ala Gly Arg Val Trp Trp Val Leu Gin Ala Ala Ala Gin Val 
35 40 45 



Asn Arg Gly Gly Gly Pro He Arg Asn Arg Pro Gly He Ala Leu Arg 
50 55 60 



Ala Ala Val Glu Gly Gly Arg Asn Arg Pro Ala Pro Tyr Ser Arg Pro 
65 70 75 80 



Lys Gin Leu Pro Asp Lys Trp Gin His Asp Leu Phe Asp Ser Gly Phe 
85 90 95 



Gly Gly Gly Ala Gly Val Glu Thr Gly Gly Lys Leu Cys Val Val Ser 
100 105 110 



Asn Leu Asp He Gly Val Ser Asp Ala Asp He Gin Glu Leu Phe Ala 
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115 120 125 



Glu Phe Gly Thr Leu Lys Lys Ala Ala Val His Tyr Asp Arg Ser Gly 
130 135 140 



Arg Ser Leu Gly Thr Ala Asp Val His Phe Glu Arg Lys Ala Asp Ala 
145 ' 150 155 160 



Leu Lys Ala Met Lys Gin Tyr Asn Gly Val Pro Leu Asp Gly Arg Pro 
165 170 175 



Met Asn lie Gin Leu Val Thr Ser Gin lie Asp Ala Gin Arg Arg Pro 
180 185 190 



Ala Gin Ser Val Asn Arg Gly Gly Met Thr Arg Asn Arg Gly Ala Gly 
195 200 205 



Gly Met Gly Glu Leu Val Glu Ala Pro Arg Arg Arg His Cys Ser Asp 
210 215 220 



Gly Phe Cys Gly Arg Gly Arg Gly Ala Gly Arg Asn Ser Lys Gin Gin 
225 ~ 230 235 240 



Leu Ser Ala Glu Glu Leu Asp Ala Gin Leu Asp Ala Tyr Asn Ala Arg 
245 250 255 



Met Asp Thr Ser 
260 



<210> 233 

<211> 243 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MIS COFEATURE 

<222> (28) . . (28) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (34) . . (34) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (47) . . (47) 

<223> X=any amino acid 
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<220> 

<221> MISC_FEATURE 

<222> (54) . . (54) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (101) . . (101) 

<223> X=any amino acid 



<400> 233 



Thr Glu Pro Glu Pro Ala Arg Arg Leu Gly Arg Val Pro Gly Pro Ala 
1 5 10 15 



Cys Arg Val Pro Ala Pro Ser Cys Gly Pro Arg Xaa Trp Gly Ala Ala 
20 25 * 30 



Ala Xaa Val Asn Arg Gly Gly Gly Pro Ser Gly Thr Gly Arg Xaa Ser 
35 40 45 



Pro Ala Ala Pro Gly Xaa Arg Gin Glu Pro Thr Gly Ala Leu Gin Gin 
50 55 60 



Ala Lys Thr Thr Ser Arg Gin Val Ala Ala Arg Ser Phe Arg Gin Trp 
6 5 70 75 80 



Leu Arg Arg Trp Cys Arg Arg Gly Asp Arg Trp Glu Thr Val Arg Gly 
85 90 95 



Val His Leu Asp Xaa Gly Val Ser Asp Ala Asp He Gin Glu Leu Phe 
100 105 HO 



Ala Glu Phe Gly Thr Leu Lys Lys Ala Ala Val His Tyr Asp Arg Ser 
115 120 125 



Gly Arg Ser Leu Gly Thr Ala Asp Val His Phe Glu Arg Lys Ala Asp 
130 135 140 



Ala Leu Lys Ala Met Lys Gin Tyr Asn Gly Val Pro Leu Asp Gly Arg 

150 155 160 



Pro Met Asn He Gin Leu Val Thr Ser Gin He Asp Ala Gin Arg Arg 
165 170 175 



Pro Ala Gin Ser Val Asn Arg Gly Gly Met Thr Arg Asn Arg Gly Ala 
180 185 ~ 190 
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Gly Gly He Gly Gly Asp Gly Gly Thr Arg Arg Gly Asn Ala Arg He 
195 200 205 



Leu Cys Gly Arg Gly Arg Gly Ala Gly Arg Asn Ser Lys Gin Gin Leu 
210 215 220 



Ser Ala Glu Glu Leu Asp Ala Gin Leu Asp Ala Tyr Asn Ala Arg Met 
225 230 235 240 



Asp Thr Ser 



<210> 234 

<211> . 428 

<212> PRT 

<213> Homo sapien 

<400> 234 



Met Thr Val Ala Glu He Lys Gly Thr Thr Gly Arg Thr Tyr Val Gly 
15 10 15 



Ser His Ala Gin Gly Arg Ser Ser Lys Cys Leu Lys Lys Thr Lys Thr 
20 25 30 



Glu Asn Pro Thr Asp Pro His Gin Thr Arg Trp Thr Arg Ser Gly Gin 
35 40 45 



Arg Leu His Arg Gly Thr Gin Ser Val Arg Gin His Ala Trp Ser Met 
50 55 60 



His Gly Pro Lys Ser Lys Asn Ala Lys His Ser Met Asp Gin His Thr 
65 70 75 80 



His Gly Gin Cys His Val Asp He Val Thr Arg Cys Met Phe He Cys 
85 90 95 



Thr Gly Gin Thr Val Thr Gly His His Pro Thr Ser Thr Ala Ala Glu 
100 105 110 



Thr Ser Met Glu Met He Asn Ser Val Arg Ala Thr Val Ala Gin Arg 
115 120 125 



Glu Ser Met Cys His Leu Arg Glu Ala Ala Tyr Thr Val Val Pro Thr 
130 135 140 
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Gly lie Ala His His Val Leu Ala Leu Asp Glu Ser Lys Ala Lys Leu 
145 150 155 160 



Ser Ser Asp Val Leu Thr Leu Leu lie Lys Gin Tyr Cys Arg Glu Ser 
165 170 175 



Gly Val Arg Asn Leu Gin Lys Gin Val Glu Lys Val Leu Arg Lys Ser 
180 185 190 



Ala Tyr Lys He Val Ser Gly Glu Ala Glu Ser Val Glu Val Thr Pro 
195 200 205 



Glu Asn Leu Gin Asp Phe Val Gly Lys Pro Val Phe Thr Val Glu Arg 
210 215 220 



Met Tyr Asp Val Thr Pro Pro Gly Val Val Met Gly Leu Ala Trp Thr 
225 .230 235 " 240 



Ala Met Gly Gly Ser Thr Leu Phe Val Glu Thr Ser Leu Arg Arg Pro 
245 250 255 



Gin Asp Lys Asp Ala Lys Gly Asp Lys Asp Gly Ser Leu Glu Val Thr 
260 265 270 



Gly Gin Leu Gly Glu Val Met Lys Glu Ser Ala Arg He Ala Tyr Thr 
275 280 285 



Phe Ala Arg Ala Phe Leu Met Gin His Ala Pro Ala Asn Asp Tyr Leu 
290 295 300 



Val Thr Ser His He His Leu His Val Pro Glu Gly Ala Thr Pro Lys 
305 310 315 320 



Asp Gly Pro Ser Ala Gly Cys Thr He Val Thr Ala Leu Leu Ser Leu 
325 330 335 



Ala Met Gly Arg Pro Val Arg Gin Asn Leu Ala Met Thr Gly Glu Val 
340 345 * 350 



Ser Leu Thr Gly Lys He Leu Pro Val Gly Gly He Lys Glu Lys Thr 
355 360 365 



He Ala Ala Lys Arg Ala Gly Val Thr Cys He Val Leu Pro Ala Glu 
370 375 380 



Asn Lys Lys Asp Phe Tyr Asp Leu Ala Ala Phe He Thr Glu Gly Leu 
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385 390 395 400 



Glu Val His Phe Val Glu His Tyr Arg Glu He Phe Asp He Ala Phe 
405 410 415 



Pro Asp Glu Gin Ala Glu Ala Leu Ala Val Glu Arg 
420 425 



<210> 


235 


<211> 


287 


<212> 


PRT 


<213> 


Homo j 


<220> 




<221> 


MISC ] 


<222> 


(4).. 


<223> 


X=any 


<400> 


235 



Leu Val Pro Xaa Xaa Ser Arg Pro Cys Val Ala Leu Asp Glu Ser Lys 
15 10 15 



Ala Lys Leu Ser Ser Asp Val Leu Thr Leu Leu He Lys Gin Tyr Cys 
20 25 30 



Arg Glu Ser Gly Val Arg Asn Leu Gin Lys Gin Val Glu Lys Val Leu 
35 40 45 



Arg Lys Ser Ala Tyr Lys He Val Ser Gly Glu Ala Glu Ser Val Glu 
50 55 60 



Val Thr Pro Glu Asn Leu Gin Asp Phe Val Gly Lys Pro Val Phe Thr 
65 70 75 80 



Val Glu Arg Met Tyr Asp Val Thr Pro Pro Gly Val Val Met Gly Leu 
85 90 95 



Ala Trp Thr Ala Met Gly Gly Ser Thr Leu Phe Val Glu Thr Ser Leu 
100 105 110 



Arg Arg Pro Gin Asp Lys Asp Ala Lys Gly Asp Lys Asp Gly Ser Leu 
115 120 125 



Glu Val Thr Gly Gin Leu Gly Glu Val Met Lys Glu Ser Ala Arg He 
130 135 140 



Ala Tyr Thr Phe Ala Arg Ala Phe Leu Met Gin His Ala Pro Ala Asn 
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145 150 155 160 



Asp Tyr Leu Val Thr Ser His He His Leu His Val Pro Glu Gly Ala 
165 170 175 



Thr Pro Lys Asp Gly Pro Ser Ala Gly Cys Thr He Val Thr Ala Leu 
180 185 190 



Leu Ser Leu Ala Met Gly Arg Pro Val Arg Gin Asn Leu Ala Met Thr 
195 200 205 



Gly Glu Val Ser Leu Thr Gly Lys He Leu Pro Val Gly Gly He Lys 
210 215 220 



Glu Lys Thr He Ala Ala Lys Arg Ala Gly Val Thr Cys He Val Leu 
225 230 235 240 



Pro Ala Glu Asn Lys Lys Asp Phe Tyr Asp Leu Ala Ala Phe He Thr 
245 250 255 



Glu Gly Leu Glu Val His Phe Val Glu His Tyr Arg Glu He Phe Asp 
260 265 270 



He Ala Phe Pro Asp Glu Gin Ala Glu Ala Leu Ala Val Glu Arg 

280 285 





275 


<210> 


236 


<211> 


247 


<212> 


PRT 


<213> 
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<400> 


236 



Met Ser Leu Ala He Leu Glu Glu Val He Met Trp Asp Pro Arg Val 
15 10 15 



Ser Arg Pro Cys Leu Pro Leu Gly His Val Ala Thr Gin Tyr Phe Ala 
20 25 30 



Glu Gin Gly His Ala Ser Val Gly Trp Ala Glu Tyr Pro Glu Glu Asp 
35 40 45 



Leu Lys Arg Thr Met Met Ala Cys Gly Gly Ser Asn Pro Asp Gin Cys 
50 55 60 



Glu Gly Ser Val Ser Arg Leu Cys Leu Ala Arg Cys Gin Val Phe Glu 
65 70 75 80 
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Glu Thr Gin Leu Gly Gly Glu Arg Tyr Asn Leu Phe Asp Trp Leu Pro 
85 90 95 



Glri Ala Lys Thr Cys Thr Phe lie Leu Arg Gly Gly Thr Glu Gin Phe 
100 105 110 



Met Glu Glu Thr Glu Arg Ser Leu His Asp Ala He Met He Val Met 
115 120 125 



Arg Ala He Lys Asn Asp Ser Val Val Ala Gly Gly Gly Ala He Glu 
130 135 140 



Met Glu Leu Ser Lys Tyr Leu Arg Asp Tyr Ser Arg Thr He Pro Gly 
145 150 155 160 



Lys Gin Gin Leu Leu lie Gly Ala Tyr Ala Lys Ala Leu Glu lie He 
165 170 175 



Pro Arg Gin Leu Cys Asp Asn Ala Gly Phe Asp Ala Thr Asn lie Leu 
180 185 190 



Asn Lys Leu Arg Ala Arg His Ala Gin Gly Gly Thr Trp Tyr Gly Val 
195 200 205 



Asp lie Asn Asn Glu Asp lie Ala Asp Asn Phe Glu Ala Phe Val Trp 
210 215 220 



Glu Pro Ala Met Val Arg He Asn Ala Leu Thr Ala Ala Ser Glu Ala 
225 230 . 235 240 



Ala Val Pro Asp Arg Val Leu 
245 



<210> 237 

<211> 154 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MIS C_FEATURE 

<222> (35) . . (35) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (51) . . (51) 

<223> X=any amino acid 
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<400> 237 

Leu Leu Pro Gin Ala Lys Thr Cys Thr Phe He Leu Arg Gly Gly Thr 
1 5 10 15 



Glu Gin Phe Met Glu Glu Thr Glu Arg Ser Leu His Asp Ala He Met 
20 25 30 



He Val Xaa Arg Ala He Lys Asn Asp Ser Val Val Ala Gly Gly Gly 
35 40 45 



Ala He Xaa Met Glu Leu Ser Lys Tyr Leu Arg Asp Tyr Ser Arg Thr 
50 55 60 



He Pro Gly Lys Gin Gin Leu Leu He Gly Ala Tyr Ala Lys Ala Leu 
65 70 75 80 



Glu He He Pro Arg Gin Leu Cys Asp Asn Ala Gly Phe Asp Ala Thr 
85 90 95 



Asn He Leu Asn Lys Leu Arg Ala Arg His Ala Gin Gly Gly Thr Trp 
100 105 110 



Tyr Gly Val Asp He Asn Asn Glu Asp He Ala Asp Asn Phe Glu Ala 
115 120 125 



Phe Val Trp Glu Pro Ala Met Val Arg He Asn Ala Leu Thr Ala Ala 
130 135 140 



Ser Glu Ala Ala Val Pro Asp Arg Val Leu 
145 150 



<210> 238 

<211> 269 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC_FEATURE 

<222> (18).. (18) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (27).. (27) 

<223> X=any amino acid 



<220> 



WO 2004/050858 



<221> MISC_FEATURE 

<222> (31).. (32) 

<223> X=any amino acid 



PCT/US2003/038808 



370/383 



<220> 
<221> 
<222> 
<223> 



MIS COFEATURE 
(37) . . (37) 
X=any amino acid 



<220> 
<221> 
<222> 
<223> 



MIS C_FE ATURE 
(41) . . (41) 
X=any amino acid 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(45).. (48) 
X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (51).. (51) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (55).. (55) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (58).. (58) 

<223> X=any amino acid 



<220> 

<221> MISC__FEATURE 

<222> (61).. (61) 

<223> X=any amino acid 



<220> 

<221> MI SC__FE ATURE 

<222> (69).. (69) 

<223> X=any amino acid 



<220> 

<221> M I SC_FEATURE 

<222> (71) . . (71) 

<223> X=any amino acid 



<220> 
<221> 



MISC FEATURE 



WO 2004/050858 PCT/US2003/038808 

371/383 



<222> (76) . . (76) 

<223> X=any amino acid 



<220> 

<221> MI SC__FE ATURE 

<222> (89) . . (90) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (94) . . (94) 

<223> X=any amino acid 



<220> 

<221> MIS C_FE ATURE 

<222> (99).. (100) 

<223> X=any amino acid 



<220> 

<221> MI SC__FE ATURE 

<222> (114) . . (114) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (116) . . (116) 

<223> X=any amino acid 



<220> 

<221> MI SC_FE ATURE 

<222> (123) . . (123) 

<22 3> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (126) . . (126) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (131) . . (131) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (239) . . (239) 

<223> X=any amino acid 



<220> 

<221> MI SC__FE ATURE 
<222> (263) . . (263) 



WO 2004/050858 



372/383 



PCT/US2003/038808 



<223> X=any amino acid 



<400> 238 



Met Ala Met Asp Leu Cys Arg Gin Asp Pro Glu Cys Glu Phe Tyr Phe 
15 10 15 



Ser Xaa Asp Ala Asp Ala Val Leu Thr Asn Xaa Gin Thr Leu Xaa Xaa 
20 25 3 0 



Leu He Glu Glu Xaa Arg Gly Val Xaa Ala Pro Met Xaa Xaa Xaa Xaa 
35 40 45 



Gly Lys Xaa Trp Ser Asn Xaa Trp Gly Xaa Leu Ser Xaa Asp Glu Tyr 
50 55 60 



Tyr Ala Arg Ser Xaa Asp Xaa Val Glu Leu Val Xaa Arg Lys Asp Val 
65 70 75 80 



Gly Val Trp Asn Val Pro Tyr He Xaa Xaa Ala Tyr Val Xaa Gly Gly 
85 90 95 



Asp Thr Xaa Xaa Met Glu Leu Pro Gin Arg Asp Val Phe Ser Gly Ser 
100 105 110 



Asp Xaa Asp Xaa Asp Met Ala Phe Cys Glu Xaa Phe Gly Xaa Lys Gly 
115 120 125 



He Phe Xaa His Leu Ser Tyr Gin His Glu Phe Gly Arg Leu Leu Ala 
130 135 140 



Thr Ser Arg Tyr Asp Thr Glu His Leu His Pro Asp Leu Trp Gin He 
145 150 155 160 



Phe Asp Asn Pro Val Asp Trp Lys Glu Gin Tyr He His Glu Asn Tyr 
165 170 175 



Ser Arg Ala Leu Glu Gly Lys Asp He Val Glu Gin Pro Cys Pro Asp 
180 185 190 



Val Tyr Trp Phe Pro Leu Leu Ser Glu Gin Met Cys Asp Glu Leu Val 
195 200 205 



Ala Glu Met Glu His Tyr Gly Gin Trp Ser Gly Gly Arg His Glu Asp 
210 215 220 
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Ser Arg Leu Ala Gly Gly Tyr Glu Asn Val Pro Thr Val Asp Xaa Pro 
225 230 235 240 



His Glu Ala Gly Gly Val Arg Gly Pro Val Ala Ala Ala Ala Ala Asp 
245 250 255 



Val Cys Gly Pro Met Thr Xaa Thr Cys Phe Arg Phe Thr 
260 265 



<210> 239 

<211> 138 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC_FEATURE 

<222> <1)..(1) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (3).. (4) 

<22 3> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (9).. (9) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (117) . . <117) 

<223> X=any amino acid 



<400> 239 

Xaa Phe Xaa Xaa Lys Gly lie Phe Xaa His Leu Ser Tyr Gin His Glu 
1 5 10 15 



Phe Gly Arg Leu Leu Ala Thr Ser Arg Tyr Asp Thr Glu His Leu His 
20 25 3 0 



Pro Asp Leu Trp Gin lie Phe Asp Asn Pro Val Asp Trp Lys Glu Gin 
35 40 45 



Tyr He His Glu Asn Tyr Ser Arg Ala Leu Glu Gly Lys Glu Ser Trp 
50 55 60 



Ser Ser His Ala Arg Thr Cys Thr Gly Ser His Cys Cys Gin Asn Lys 
65 70 75 80 
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Cys Val Met Ser Trp Trp Gin Arg Trp Ser Thr Thr Ala Ser Gly Gin 
85 90 95 



Ala Ala Gly Met Arg He Gin Gly Trp Leu Glu Ala Thr Arg Met Cys 
100 105 110 



Pro Pro Trp Thr Xaa His Met Lys Gin Val Gly Tyr Glu Asp Gin Trp 
115 120 125 



Leu Gin Leu Leu Arg Thr Tyr Val Gly Pro 
130 135 



<210> 240 

<211> 233 

<212> PRT 

<213> Homo sapien 

<400> 240 

Met Lys Met Val Ala Pro Trp Thr Arg Phe Tyr Ser Asn Ser Cys Cys 
15 10 15 



Leu Cys Cys His Val Arg Thr Gly Thr He Leu Leu Gly Val Trp Tyr 
20 25 30 



Leu He He Asn Ala Val Val Leu Leu He Leu Leu Ser Ala Leu Ala 
35 40 45 



Asp Pro Asp Gin Tyr Asn Phe Ser Ser Ser Glu Leu Gly Gly Asp Phe 
50 55 60 



Glu Phe Met Asp Asp Ala Asn Met Cys He Ala He Ala lie Ser Leu 
65 70 75 80 



Leu Met He Leu He Cys Ala Met Ala Thr Tyr Gly Ala Tyr Lys Gin 
85 90 95 



Arg Ala Ala Trp He lie Pro Phe Phe Cys Tyr Gin lie Phe Asp Phe 
100 105 110 



Ala Leu Asn Met Leu Val Ala lie Thr Val Leu lie Tyr Pro Asn Ser 
115 120 125 



lie Gin Glu Tyr He Arg Gin Leu Pro Pro Asn Phe Pro Tyr Arg Asp 
130 135 140 
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Asp Val Met Ser Val Asn Pro Thr Cys Leu Val Leu lie He Leu Leu 
145 150 155 160 



Phe He Ser He He Leu Thr Phe Lys Gly Tyr Leu He Ser Cys Val 
165 170 175 



Trp Asn Cys Tyr Arg Tyr He Asn Gly Arg Asn Ser Ser Asp Val Leu 
180 185 190 



Val Tyr Val Thr Ser Asn Asp Thr Thr Val Leu Leu Pro Pro Tyr Asp 
195 200 205 



Asp Ala Thr Val Asn Gly Ala Cys Gin Gly Ala Pro Ala Pro Tyr Val 
210 215 220 



Leu Pro Lys Pro Ser Ser Gly Arg Thr 
225 230 



<210> 


241 


<211> 


319 


<212> 


PRT 


<213> 


Homo sap i en 


<220> 




<221> 


MISC FEATURE 


<222> 


(2) . . (5) 


<223> 


X=any amino acid 


<400> 


241 


Val Xaa Xaa Xaa Xaa Ser 


1 


5 



10 15 



Arg Val Pro Thr Arg Gin Lys Leu Gly Ala Leu Gly Val Ser Arg Arg 
20 25 30 



Gin Ala Arg Gly Arg Thr Gly Glu Arg Ala Gly Ser Arg Ser Gly Gly 
35 40 45 



Gly Ala Gly Ser Ser Gly Ala Ala Gly Ser Arg Arg Gly Gly Leu Gin 
50 55 60 



Ala Arg Arg Ser Thr Leu Leu Lys Thr Cys Ala Arg Ala Arg Ala Thr 
65 70 75 80 



Ala Pro Gly Ala Met Lys Met Val Ala Pro Trp Thr Arg Phe Tyr Ser 
85 90 95 
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Asn Ser Cys Cys Leu Cys Cys His Val Arg Thr Gly Thr lie Leu Leu 
100 105 110 



Gly Val Trp Tyr Leu lie lie Asn Ala Val Val Leu Leu lie Leu Leu 
115 120 125 



Ser Ala Leu Ala Asp Pro Asp Gin Tyr Asn Phe Ser Ser Ser Glu Leu 
130 135 140 



Gly Gly Asp Phe Glu Phe Met Asp Asp Ala Asn Met Cys lie Ala lie 
145 150 155 160 



Ala lie Ser Leu Leu Met lie Leu lie Cys Ala Met Ala Thr Tyr Gly 
165 170 175 



Ala Tyr Lys Gin Arg Ala Ala Trp lie lie Pro Phe Phe Cys Tyr Gin 
180 185 190 



lie Phe Asp Phe Ala Leu Asn Met Leu Val Ala lie Thr Val Leu lie 
195 200 205 



Tyr Pro Asn Ser lie Gin Glu Tyr lie Arg Gin Leu Pro Pro Asn Phe 
210 215 220 



Pro Tyr Arg Asp Asp Val Met Ser Val Asn Pro Thr Cys Leu Val Leu 
225 230 235 240 



lie lie Leu Leu Phe lie Ser lie lie Leu Thr Phe Lys Gly Tyr Leu 
245 250 255 



lie Ser Cys Val Trp Asn Cys Tyr Arg Tyr lie Asn Gly Arg Asn Ser 
260 265 270 



Ser Asp Val Leu Val Tyr Val Thr Ser Asn Asp Thr Thr Val Leu Leu 
275 280 285 



Pro Pro Tyr Asp Asp Ala Thr Val Asn Gly Ala Cys Gin Gly Ala Pro 
290 295 300 



Ala Leu Thr Cys Cys Leu Ser Leu Gin Val Gly Gly Pro Glu Gly 
305 310 315 



<210> 242 

<211> 76 

<212> PRT 

<213> Homo sapien 
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<400> 242 

Met Asp Ser Ser Gly Ala Arg Ala Gly Thr Val Cys Thr Leu His Gly 
1 5 10 15 



lie His Arg Gly His Gin Lys Pro Gly Leu Phe Leu Ser Arg Ser Ser 
20 25 30 



Asn Val Phe Ser Ser Asp Ser His His Leu Gin Leu Val Pro Thr Gin 
35 40 45 



Cys Cys His Ser Arg Leu Thr Gin Val Leu Arg Pro Ser Thr Ser Ser 
50 55 60 



Ala Ala Arg Ala His Leu Arg Leu Asp Cys Thr lie 
65 70 ' 75 



<210> 243 

<211> 72 

<212> PRT 

<213> Homo sapien 

<400> 243 

Arg Ser Gly Cys Leu He Tyr Gly Leu Ala Leu Asp Ala Pro Ser Gly 
1 5 10 15 



Arg Ser Val Met Asp Ser Ser Gly Ala Arg Ala Gly Thr Val Cys Thr 
20 25 30 



Leu His Gly He His Arg Gly His Gin Lys Pro Gly Leu Phe Leu Ser 
35 40 45 



Arg Ser Ser Asn Ala Ser Ala Ala He Ala Thr Thr Tyr Ser Ser Ser 
50 55 60 



Pro His Ser Ala Ala Thr Ala Gly 
65 70 



<210> 244 

<211> 61 

<212> PRT 

<213> Homo sapien 

<400> 244 

Met Gly Phe He Glu Ala Thr Arg Asn Gin Ala Phe Ser Cys Pro Glu 
1 5 10 15 



Val Pro Ser Phe Ser Ser Asp Ser His His Leu Gin Leu Val Pro Thr 
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20 25 30 



Gin Cys Cys His Ser Arg Leu Thr Gin Val Leu Arg Pro Ser Thr Ser 
35 40 45 



Ser Ala Ala Arg Ala His Leu Arg Leu Asp Cys Thr lie 
50 55 60 



<210> 245 

<211> 57 

<212> PRT 

<213> Homo sapien 

<400> 245 

Arg Pro Pro Glu Thr Arg Pro Phe Leu Val Gin Lys Phe Gin Arg Phe 
1 5 io is 



Ser Ser Asp Ser His His Leu Gin Leu Val Pro Thr Gin Cys Cys His 
20 25 30 



Ser Arg Leu Thr Gin Val Leu Arg Pro Ser Thr Ser Ser Ala Ala Arg 
35 40 45 



Ala His Leu Arg Leu Asp Cys Thr lie 
50 55 



<210> 246 

<211> 267 

<212> PRT 

<213> Homo sapien 



<220> 

<221> MISC_FEATURE 

<222> (263) . . (264) 

<223> X=any amino acid 



<220> 

<221> MI S COFEATURE 

<222> (267) . . (267) 

<223> X=any amino acid 



<400> 246 

Ala Asp Glu Thr Ser Glu Lys Glu Gin Gin Glu Ala lie Glu His lie 
15 10 15 



Asp Glu Val Gin Asn Glu He Asp Arg Leu Asn Glu Gin Ala Ser Glu 
20 25 30 
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Glu lie Leu Lys Val Glu Gin Lys Tyr Asn Lys Leu Arg Gin Pro Phe 
35 40 45 



Val Gin Lys Arg Ser Glu Leu lie Ala Lys lie Pro Thr Phe Leu Gly 
50 55 60 



Asn Asn lie Cys Gin Pro Ser Thr Ser Val Cys Thr Ala Leu Arg Glu 
65 70 75 80 



Glu Asp Glu Glu Ala Leu His Tyr Leu Thr Arg Val Glu Val Thr Glu 
85 90 95 



Phe Glu Asp lie Lys Ser Gly Tyr Arg He Asp Phe Asp Phe Asp Gly 
100 105 110 



Asn Pro Tyr Phe Glu Asn Lys Val Leu Ser Lys Glu Phe His Leu Asn 
115 120 125 



Glu Ser Gly Asp Pro Ser Ser Lys Ser Thr Glu lie Lys Trp Lys Ser 
130 135 140 



Gly Lys Asp Leu Thr Lys Arg Ser Ser Gin Thr Gin Asn Lys Ala Ser 
145 150 155 160 



Arg Lys Arg Gin His Glu Glu Pro Glu Ser Phe Phe Thr Trp Phe Thr 
165 170 175 



Asp His Ser Asp Ala Gly Ala Asp Glu Leu Gly Glu Val He Lys Asp 
180 185 190 



Asp He Trp Pro Asn Pro Leu Gin Tyr Tyr Leu Val Pro Asp Met Asp 
195 200 205 



Asp Glu Glu Gly Glu Gly Glu Glu Asp Asp Asp Asp Asp Glu Glu Glu 
210 215 220 



Glu Gly Leu Glu Asp He Ala Lys Asn Gly Asp Glu Asp Glu Gly Asp 
225 230 235 240 



Gly Met Met Lys He Asp Asp Glu Val Gly Gly Thr Glu Gin Gly Gly 
245 250 255 



Leu Lys Lys Lys Phe Leu Xaa Xaa Pro Pro Xaa 
260 265 



<210> 247 
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<211> 148 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MI SC_ FEATURE 

<222> (3).. (3) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (17).. (17) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (134) . . (134) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (136) . . (136) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (138) . . (139) 

<223> X=any amino acid 



<220> 

< 2 2 1 > MIS C_FEATURE 

<222> (141) . . (141) 

<223> X=any amino acid 



<400> 247 

Ser Phe Xaa Phe Asp Gly Asn Pro Tyr Phe Glu Asn Lys Val Leu Ser 
1 5 10 15 

Xaa Glu Phe His Leu Asn Glu Ser Gly Asp Pro Ser Ser Lys Ser Thr 
20 25 30 

Glu He Lys Trp Lys Ser Gly Lys Asp Leu Thr Lys Arg Ser Ser Gin 
35 40 45 

Thr Gin Asn Lys Ala Ser Arg Lys Arg Gin His Glu Glu Pro Glu Ser 
50 55 60 



Phe Phe Thr Trp Phe Thr Asp His Ser Asp Ala Gly Ala Asp Glu Leu 
65 70 75 80 
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Gly Glu Val lie Lys Asp Asp lie Trp Pro Asn Pro Leu Gin Tyr Tyr 
85 90 95 



Leu Val Pro Asp Met Asp Asp Glu Glu Gly Glu Gly Glu Glu Asp Asp 
100 105 no 



Asp Asp Asp Glu Glu Glu Glu Gly Leu Glu Asp He Ala Lys Lys Gly 
115 120 " 125 



Met Arg Met Lys Val Xaa Lys Xaa Lys Xaa Xaa Met Xaa Trp Glu Glu 
1^0 135 140 



Gly Gin Glu Asp 
145 



<210> 


248 


<211> 


234 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


248 



Met Lys Met Val Ala Pro Trp Thr Arg Phe Tyr Ser Asn Ser Cys Cys 
1 5 io 15 



Leu Cys Cys His Val Arg Thr Gly Thr He Leu Leu Gly Val Trp Tyr 
20 25 30 



Leu He He Asn Ala Val Val Leu Leu He Leu Leu Ser Ala Leu Ala 
35 40 45 



Asp Pro Asp Gin Tyr Asn Phe Ser Ser Ser Glu Leu Gly Gly Asp Phe 
50 55 60 



Glu Phe Met Asp Asp Ala Lys He Leu Phe Asn Leu Ser Ala Asp Met 
65 70 75 80 



Cys He Ala He Ala He Ser Leu Leu Met He Leu He Cys Ala Met 
85 90 95 



Ala Thr Tyr Gly Ala Tyr Lys Gin Arg Ala Ala Trp He He Pro Phe 
100 105 no 



Phe Cys Tyr Gin He Phe Asp Phe Ala Leu Asn Met Leu Val Ala He 
115 120 125 



Thr Val Leu He Tyr Pro Asn Ser He Gin Glu Tyr He Arg Gin Leu 
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130 



135 



140 



Pro Pro Asn Phe 
145 



Pro Tyr Arg Asp Asp Val Met Ser Val Asn Pro Thr 
150 155 160 



Cys Leu Val Leu 



lie lie Leu Leu Phe lie Ser lie lie Leu Thr Phe 
165 170 175 



Lys Gly Tyr Leu 
180 



lie Ser Cys Val Trp Asn Cys Tyr Arg Tyr lie Asn 
185 190 



Gly Arg Asn Ser 
195 



Ser Asp Val Leu Val Tyr Val Thr Ser Asn Asp Thr 
200 205 



Thr Val Leu Leu 
210 



Pro Pro Tyr Asp Asp Ala Thr Val Asn Gly Ala Ala 
215 220 



Lys Glu Pro Pro 

225 



Pro Pro Tyr Val Ser Ala 
230 



<210> 249 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 249 

ggccaactct ttttactcct tcatt 25 



<210> 250 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 250 

caggagcatc tccgttttca tt 22 



<210> 251 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 251 

tccagtagtt gggcagtgct ggca 24 
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<210> 252 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 252 

tcggcagaca tgtgcattg 19 



<210> 253 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 253 

cgttgcttgt acgctccgta a 21 



<210> 254 

<211> 33 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Synthetic 



<400> 254 

cattgcgatt tctcttctca tgatcctgat atg 



33 
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